Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 78033 |
| Missing cells | 649732 |
| Missing cells (%) | 23.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 21.4 MiB |
| Average record size in memory | 288.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 24 |
| Unsupported | 2 |
Closed has constant value "0" | Constant |
Name has a high cardinality: 22710 distinct values | High cardinality |
Address has a high cardinality: 6618 distinct values | High cardinality |
StreetName has a high cardinality: 669 distinct values | High cardinality |
BldgNo has a high cardinality: 94 distinct values | High cardinality |
UnitNo has a high cardinality: 3335 distinct values | High cardinality |
PostalCode has a high cardinality: 2902 distinct values | High cardinality |
Location has a high cardinality: 56 distinct values | High cardinality |
NAICSDescr has a high cardinality: 1041 distinct values | High cardinality |
Phone has a high cardinality: 25064 distinct values | High cardinality |
Fax has a high cardinality: 15752 distinct values | High cardinality |
TollFree has a high cardinality: 4117 distinct values | High cardinality |
EMail has a high cardinality: 15058 distinct values | High cardinality |
WebAddress has a high cardinality: 14200 distinct values | High cardinality |
EmplUpdate has a high cardinality: 433 distinct values | High cardinality |
Character has a high cardinality: 56 distinct values | High cardinality |
CHArea has a high cardinality: 57 distinct values | High cardinality |
Modified has a high cardinality: 189 distinct values | High cardinality |
X is highly overall correlated with Y and 1 other fields | High correlation |
Y is highly overall correlated with X and 1 other fields | High correlation |
BusinessID is highly overall correlated with FID and 2 other fields | High correlation |
Ward is highly overall correlated with CENT_X | High correlation |
CENT_X is highly overall correlated with Location and 1 other fields | High correlation |
CENT_Y is highly overall correlated with Location and 1 other fields | High correlation |
Year is highly overall correlated with X and 3 other fields | High correlation |
RecordID is highly overall correlated with FID and 2 other fields | High correlation |
Character is highly overall correlated with FID and 3 other fields | High correlation |
BIA_NAME is highly overall correlated with FID and 2 other fields | High correlation |
EmplRange is highly overall correlated with NAICSCat and 1 other fields | High correlation |
CHArea is highly overall correlated with FID and 5 other fields | High correlation |
Sector_Des is highly overall correlated with NAICSCat | High correlation |
BIAFulName is highly overall correlated with FID and 2 other fields | High correlation |
FID is highly overall correlated with BusinessID and 7 other fields | High correlation |
BldgNo is highly overall correlated with Location and 2 other fields | High correlation |
Location is highly overall correlated with FID and 6 other fields | High correlation |
NAICSCat is highly overall correlated with Location and 5 other fields | High correlation |
PIN is highly overall correlated with FID and 2 other fields | High correlation |
X has 48606 (62.3%) missing values | Missing |
Y has 48606 (62.3%) missing values | Missing |
Location has 47694 (61.1%) missing values | Missing |
EmplRange has 2646 (3.4%) missing values | Missing |
EmplUpdate has 15002 (19.2%) missing values | Missing |
Sector_Des has 63431 (81.3%) missing values | Missing |
CENT_X has 47694 (61.1%) missing values | Missing |
CENT_Y has 47694 (61.1%) missing values | Missing |
PIN has 30339 (38.9%) missing values | Missing |
Character has 61682 (79.0%) missing values | Missing |
CHArea has 46690 (59.8%) missing values | Missing |
Modified has 63218 (81.0%) missing values | Missing |
BIA_NAME has 63208 (81.0%) missing values | Missing |
BIAFulName has 63208 (81.0%) missing values | Missing |
StreetNo is highly skewed (γ1 = 147.6519659) | Skewed |
NAICSCode is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
isnew is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-02-20 22:12:05.941189 |
|---|---|
| Analysis finished | 2023-02-20 22:12:34.863102 |
| Duration | 28.92 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
| Distinct | 8684 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 48606 |
| Missing (%) | 62.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 306553.47 |
| Minimum | -79.80298 |
|---|---|
| Maximum | 617060.11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 14602 |
| Negative (%) | 18.7% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | -79.80298 |
|---|---|
| 5-th percentile | -79.716419 |
| Q1 | -79.64992 |
| median | 598535.65 |
| Q3 | 608829.52 |
| 95-th percentile | 613567.3 |
| Maximum | 617060.11 |
| Range | 617139.91 |
| Interquartile range (IQR) | 608909.17 |
Descriptive statistics
| Standard deviation | 304335.28 |
|---|---|
| Coefficient of variation (CV) | 0.99276409 |
| Kurtosis | -1.9996012 |
| Mean | 306553.47 |
| Median Absolute Deviation (MAD) | 17202.025 |
| Skewness | -0.014922506 |
| Sum | 9.0209489 × 109 |
| Variance | 9.261996 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 609566.1112 | 201 | 0.3% |
| -79.64275968 | 185 | 0.2% |
| -79.60364656 | 123 | 0.2% |
| 607701.737 | 121 | 0.2% |
| -79.71222857 | 113 | 0.1% |
| -79.63864759 | 107 | 0.1% |
| 604057.4854 | 101 | 0.1% |
| 609718.3353 | 100 | 0.1% |
| -79.56936408 | 91 | 0.1% |
| 615498.4771 | 66 | 0.1% |
| Other values (8674) | 28219 | |
| (Missing) | 48606 |
| Value | Count | Frequency (%) |
| -79.80298035 | 1 | < 0.1% |
| -79.8014612 | 1 | < 0.1% |
| -79.79447393 | 1 | < 0.1% |
| -79.79439767 | 1 | < 0.1% |
| -79.78884298 | 1 | < 0.1% |
| -79.78871792 | 20 | |
| -79.78850259 | 1 | < 0.1% |
| -79.78675536 | 5 | < 0.1% |
| -79.78630211 | 12 | |
| -79.78452433 | 11 |
| Value | Count | Frequency (%) |
| 617060.1055 | 1 | |
| 616918.4738 | 1 | |
| 616839.6893 | 1 | |
| 616837.5953 | 1 | |
| 616769.3441 | 1 | |
| 616704.5391 | 1 | |
| 616692.2284 | 1 | |
| 616667.6043 | 1 | |
| 616657.8816 | 1 | |
| 616643.3766 | 1 |
| Distinct | 8684 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 48606 |
| Missing (%) | 62.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2433290.7 |
| Minimum | 43.48517 |
|---|---|
| Maximum | 4843106.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 43.48517 |
|---|---|
| 5-th percentile | 43.53859 |
| Q1 | 43.608514 |
| median | 4818092 |
| Q3 | 4829966.3 |
| 95-th percentile | 4838021.6 |
| Maximum | 4843106.9 |
| Range | 4843063.4 |
| Interquartile range (IQR) | 4829922.6 |
Descriptive statistics
| Standard deviation | 2414921.5 |
|---|---|
| Coefficient of variation (CV) | 0.99245088 |
| Kurtosis | -1.9998953 |
| Mean | 2433290.7 |
| Median Absolute Deviation (MAD) | 23561.033 |
| Skewness | -0.015148997 |
| Sum | 7.1604446 × 1010 |
| Variance | 5.8318459 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4827535.97 | 201 | 0.3% |
| 43.59351505 | 185 | 0.2% |
| 43.67999884 | 123 | 0.2% |
| 4838234.833 | 121 | 0.2% |
| 43.55837136 | 113 | 0.1% |
| 43.72011759 | 107 | 0.1% |
| 4823601.861 | 101 | 0.1% |
| 4841653.08 | 100 | 0.1% |
| 43.5935916 | 91 | 0.1% |
| 4827677.175 | 66 | 0.1% |
| Other values (8674) | 28219 | |
| (Missing) | 48606 |
| Value | Count | Frequency (%) |
| 43.48517014 | 1 | |
| 43.48968489 | 1 | |
| 43.4915708 | 1 | |
| 43.49199992 | 2 | |
| 43.49224252 | 1 | |
| 43.49454092 | 1 | |
| 43.49517064 | 1 | |
| 43.49608236 | 1 | |
| 43.49636475 | 1 | |
| 43.49652992 | 2 |
| Value | Count | Frequency (%) |
| 4843106.933 | 3 | |
| 4843045.912 | 1 | < 0.1% |
| 4842995.781 | 2 | |
| 4842852.901 | 1 | < 0.1% |
| 4842722.486 | 1 | < 0.1% |
| 4842531.982 | 2 | |
| 4842304.058 | 2 | |
| 4842274.717 | 1 | < 0.1% |
| 4842274.399 | 2 | |
| 4842200.556 | 2 |
FID
Real number (ℝ)
| Distinct | 16518 |
|---|---|
| Distinct (%) | 21.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7823.163 |
| Minimum | 1 |
|---|---|
| Maximum | 16518 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 781 |
| Q1 | 3902 |
| median | 7804 |
| Q3 | 11705 |
| 95-th percentile | 14902 |
| Maximum | 16518 |
| Range | 16517 |
| Interquartile range (IQR) | 7803 |
Descriptive statistics
| Standard deviation | 4538.4885 |
|---|---|
| Coefficient of variation (CV) | 0.58013472 |
| Kurtosis | -1.1665313 |
| Mean | 7823.163 |
| Median Absolute Deviation (MAD) | 3902 |
| Skewness | 0.024778868 |
| Sum | 6.1046488 × 108 |
| Variance | 20597878 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5 | < 0.1% |
| 9727 | 5 | < 0.1% |
| 9729 | 5 | < 0.1% |
| 9730 | 5 | < 0.1% |
| 9731 | 5 | < 0.1% |
| 9732 | 5 | < 0.1% |
| 9733 | 5 | < 0.1% |
| 9734 | 5 | < 0.1% |
| 9735 | 5 | < 0.1% |
| 9736 | 5 | < 0.1% |
| Other values (16508) | 77983 |
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 5 | |
| 3 | 5 | |
| 4 | 5 | |
| 5 | 5 | |
| 6 | 5 | |
| 7 | 5 | |
| 8 | 5 | |
| 9 | 5 | |
| 10 | 5 |
| Value | Count | Frequency (%) |
| 16518 | 1 | |
| 16517 | 1 | |
| 16516 | 1 | |
| 16515 | 1 | |
| 16514 | 1 | |
| 16513 | 1 | |
| 16512 | 1 | |
| 16511 | 1 | |
| 16510 | 1 | |
| 16509 | 1 |
BusinessID
Real number (ℝ)
| Distinct | 21240 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34656.92 |
| Minimum | 2 |
|---|---|
| Maximum | 94424 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2230 |
| Q1 | 9764 |
| median | 19183 |
| Q3 | 55026 |
| 95-th percentile | 88915 |
| Maximum | 94424 |
| Range | 94422 |
| Interquartile range (IQR) | 45262 |
Descriptive statistics
| Standard deviation | 29857.678 |
|---|---|
| Coefficient of variation (CV) | 0.8615214 |
| Kurtosis | -0.9937126 |
| Mean | 34656.92 |
| Median Absolute Deviation (MAD) | 16020 |
| Skewness | 0.65053975 |
| Sum | 2.7043834 × 109 |
| Variance | 8.9148093 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85606 | 6 | < 0.1% |
| 1055 | 5 | < 0.1% |
| 19338 | 5 | < 0.1% |
| 19580 | 5 | < 0.1% |
| 20871 | 5 | < 0.1% |
| 19831 | 5 | < 0.1% |
| 19332 | 5 | < 0.1% |
| 19583 | 5 | < 0.1% |
| 19832 | 5 | < 0.1% |
| 19584 | 5 | < 0.1% |
| Other values (21230) | 77982 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 7 | 5 | |
| 10 | 5 | |
| 12 | 3 | |
| 16 | 5 | |
| 18 | 5 | |
| 20 | 5 | |
| 21 | 5 | |
| 23 | 5 | |
| 26 | 4 |
| Value | Count | Frequency (%) |
| 94424 | 1 | |
| 94423 | 1 | |
| 94419 | 1 | |
| 94371 | 1 | |
| 94321 | 1 | |
| 94319 | 1 | |
| 94318 | 1 | |
| 94317 | 1 | |
| 94313 | 1 | |
| 94293 | 1 |
Name
Categorical
| Distinct | 22710 |
|---|---|
| Distinct (%) | 29.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Subway | 212 |
|---|---|
| Tim Hortons | 181 |
| Petro Canada | 123 |
| Shoppers Drug Mart | 102 |
| Tim Horton's | 97 |
| Other values (22705) |
Length
| Max length | 118 |
|---|---|
| Median length | 76 |
| Mean length | 22.654351 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1767787 |
|---|---|
| Distinct characters | 93 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 5009 ? |
|---|---|
| Unique (%) | 6.4% |
Sample
| 1st row | Golf Trends Inc. |
|---|---|
| 2nd row | Apex Graphics Inc. |
| 3rd row | Sands, John & Associates Limited |
| 4th row | Printmedia-Tackaberry Times |
| 5th row | S W R Industries Ltd. |
Common Values
| Value | Count | Frequency (%) |
| Subway | 212 | 0.3% |
| Tim Hortons | 181 | 0.2% |
| Petro Canada | 123 | 0.2% |
| Shoppers Drug Mart | 102 | 0.1% |
| Tim Horton's | 97 | 0.1% |
| PLASP Child Care Centre | 96 | 0.1% |
| Dollarama | 92 | 0.1% |
| Starbucks | 88 | 0.1% |
| Shell Canada | 84 | 0.1% |
| Royal Bank of Canada | 78 | 0.1% |
| Other values (22700) | 76880 |
Length
| Value | Count | Frequency (%) |
| inc | 15794 | 5.7% |
| 9127 | 3.3% | |
| ltd | 7946 | 2.9% |
| canada | 4795 | 1.7% |
| centre | 2969 | 1.1% |
| and | 2598 | 0.9% |
| services | 2443 | 0.9% |
| the | 2359 | 0.8% |
| a | 2092 | 0.8% |
| of | 2044 | 0.7% |
| Other values (16113) | 225480 |
Most occurring characters
| Value | Count | Frequency (%) |
| 199928 | 11.3% | |
| e | 132590 | 7.5% |
| a | 128136 | 7.2% |
| n | 115216 | 6.5% |
| i | 104250 | 5.9% |
| r | 101894 | 5.8% |
| o | 97613 | 5.5% |
| t | 94807 | 5.4% |
| s | 77470 | 4.4% |
| l | 62777 | 3.6% |
| Other values (83) | 653106 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1236773 | |
| Uppercase Letter | 275471 | 15.6% |
| Space Separator | 199928 | 11.3% |
| Other Punctuation | 44369 | 2.5% |
| Decimal Number | 4222 | 0.2% |
| Dash Punctuation | 4194 | 0.2% |
| Close Punctuation | 1272 | 0.1% |
| Open Punctuation | 1266 | 0.1% |
| Math Symbol | 178 | < 0.1% |
| Final Punctuation | 99 | < 0.1% |
| Other values (5) | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 132590 | |
| a | 128136 | |
| n | 115216 | |
| i | 104250 | 8.4% |
| r | 101894 | 8.2% |
| o | 97613 | 7.9% |
| t | 94807 | 7.7% |
| s | 77470 | 6.3% |
| l | 62777 | 5.1% |
| c | 60202 | 4.9% |
| Other values (20) | 261818 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 35962 | |
| S | 28667 | 10.4% |
| I | 23883 | 8.7% |
| M | 18396 | 6.7% |
| L | 18129 | 6.6% |
| A | 17083 | 6.2% |
| P | 16975 | 6.2% |
| T | 15559 | 5.6% |
| D | 13515 | 4.9% |
| B | 11145 | 4.0% |
| Other values (17) | 76157 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 29522 | |
| & | 7166 | 16.2% |
| , | 3463 | 7.8% |
| ' | 3108 | 7.0% |
| / | 898 | 2.0% |
| : | 88 | 0.2% |
| # | 35 | 0.1% |
| @ | 29 | 0.1% |
| ! | 26 | 0.1% |
| " | 16 | < 0.1% |
| Other values (2) | 18 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 906 | |
| 2 | 760 | |
| 0 | 712 | |
| 4 | 418 | |
| 3 | 334 | 7.9% |
| 9 | 287 | 6.8% |
| 8 | 245 | 5.8% |
| 7 | 197 | 4.7% |
| 5 | 184 | 4.4% |
| 6 | 179 | 4.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 152 | |
| | | 25 | 14.0% |
| > | 1 | 0.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1264 | |
| ] | 8 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 199928 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4194 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1266 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 99 |
Control
| Value | Count | Frequency (%) |
| 6 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Format
| Value | Count | Frequency (%) |
| | 3 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1512244 | |
| Common | 255543 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 132590 | 8.8% |
| a | 128136 | 8.5% |
| n | 115216 | 7.6% |
| i | 104250 | 6.9% |
| r | 101894 | 6.7% |
| o | 97613 | 6.5% |
| t | 94807 | 6.3% |
| s | 77470 | 5.1% |
| l | 62777 | 4.2% |
| c | 60202 | 4.0% |
| Other values (47) | 537289 |
Common
| Value | Count | Frequency (%) |
| 199928 | ||
| . | 29522 | 11.6% |
| & | 7166 | 2.8% |
| - | 4194 | 1.6% |
| , | 3463 | 1.4% |
| ' | 3108 | 1.2% |
| ( | 1266 | 0.5% |
| ) | 1264 | 0.5% |
| 1 | 906 | 0.4% |
| / | 898 | 0.4% |
| Other values (26) | 3828 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1767609 | |
| Punctuation | 102 | < 0.1% |
| None | 76 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 199928 | 11.3% | |
| e | 132590 | 7.5% |
| a | 128136 | 7.2% |
| n | 115216 | 6.5% |
| i | 104250 | 5.9% |
| r | 101894 | 5.8% |
| o | 97613 | 5.5% |
| t | 94807 | 5.4% |
| s | 77470 | 4.4% |
| l | 62777 | 3.6% |
| Other values (75) | 652928 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 99 | |
| | 3 | 2.9% |
None
| Value | Count | Frequency (%) |
| é | 67 | |
| ü | 4 | 5.3% |
| ē | 2 | 2.6% |
| É | 1 | 1.3% |
| ä | 1 | 1.3% |
| © | 1 | 1.3% |
Address
Categorical
| Distinct | 6618 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 100 City Centre Dr | 954 |
|---|---|
| 5100 Erin Mills Pky | 523 |
| 7205 Goreway Dr | 483 |
| 1250 South Service Rd | 394 |
| 1550 South Gateway Rd | 284 |
| Other values (6613) |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 16.625543 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1297341 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 292 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 300 Ambassador Dr |
|---|---|
| 2nd row | 320 Ambassador Dr |
| 3rd row | 320 Ambassador Dr |
| 4th row | 320 Ambassador Dr |
| 5th row | 321 Ambassador Dr |
Common Values
| Value | Count | Frequency (%) |
| 100 City Centre Dr | 954 | 1.2% |
| 5100 Erin Mills Pky | 523 | 0.7% |
| 7205 Goreway Dr | 483 | 0.6% |
| 1250 South Service Rd | 394 | 0.5% |
| 1550 South Gateway Rd | 284 | 0.4% |
| 4141 Dixie Rd | 248 | 0.3% |
| 2225 Erin Mills Pky | 238 | 0.3% |
| 50 Burnhamthorpe Rd W | 229 | 0.3% |
| 2355 Derry Rd E | 212 | 0.3% |
| 2000 Credit Valley Rd | 212 | 0.3% |
| Other values (6608) | 74256 |
Length
| Value | Count | Frequency (%) |
| rd | 28597 | 10.8% |
| dr | 17908 | 6.8% |
| e | 12047 | 4.6% |
| st | 9954 | 3.8% |
| blvd | 8013 | 3.0% |
| w | 7245 | 2.7% |
| dundas | 4805 | 1.8% |
| ave | 3977 | 1.5% |
| matheson | 2625 | 1.0% |
| pky | 2579 | 1.0% |
| Other values (3761) | 165839 |
Most occurring characters
| Value | Count | Frequency (%) |
| 185559 | 14.3% | |
| r | 77073 | 5.9% |
| e | 71981 | 5.5% |
| a | 58783 | 4.5% |
| d | 55945 | 4.3% |
| 0 | 51080 | 3.9% |
| n | 49723 | 3.8% |
| 5 | 48031 | 3.7% |
| t | 47994 | 3.7% |
| i | 45040 | 3.5% |
| Other values (54) | 606132 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 636955 | |
| Decimal Number | 287143 | |
| Uppercase Letter | 187147 | 14.4% |
| Space Separator | 185559 | 14.3% |
| Dash Punctuation | 480 | < 0.1% |
| Other Punctuation | 54 | < 0.1% |
| Modifier Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 77073 | |
| e | 71981 | |
| a | 58783 | |
| d | 55945 | |
| n | 49723 | 7.8% |
| t | 47994 | 7.5% |
| i | 45040 | 7.1% |
| o | 36413 | 5.7% |
| l | 32505 | 5.1% |
| s | 27700 | 4.3% |
| Other values (15) | 133798 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 31751 | |
| D | 29024 | |
| S | 18789 | |
| E | 16442 | |
| B | 14485 | |
| C | 13383 | |
| W | 11748 | 6.3% |
| M | 9512 | 5.1% |
| A | 9382 | 5.0% |
| T | 6499 | 3.5% |
| Other values (14) | 26132 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 51080 | |
| 5 | 48031 | |
| 1 | 41653 | |
| 2 | 31311 | |
| 3 | 25187 | |
| 6 | 23265 | |
| 7 | 20531 | |
| 4 | 17381 | 6.1% |
| 9 | 14549 | 5.1% |
| 8 | 14155 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 46 | |
| . | 8 | 14.8% |
Space Separator
| Value | Count | Frequency (%) |
| 185559 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 480 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 824102 | |
| Common | 473239 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 77073 | 9.4% |
| e | 71981 | 8.7% |
| a | 58783 | 7.1% |
| d | 55945 | 6.8% |
| n | 49723 | 6.0% |
| t | 47994 | 5.8% |
| i | 45040 | 5.5% |
| o | 36413 | 4.4% |
| l | 32505 | 3.9% |
| R | 31751 | 3.9% |
| Other values (39) | 316894 |
Common
| Value | Count | Frequency (%) |
| 185559 | ||
| 0 | 51080 | 10.8% |
| 5 | 48031 | 10.1% |
| 1 | 41653 | 8.8% |
| 2 | 31311 | 6.6% |
| 3 | 25187 | 5.3% |
| 6 | 23265 | 4.9% |
| 7 | 20531 | 4.3% |
| 4 | 17381 | 3.7% |
| 9 | 14549 | 3.1% |
| Other values (5) | 14692 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1297341 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 185559 | 14.3% | |
| r | 77073 | 5.9% |
| e | 71981 | 5.5% |
| a | 58783 | 4.5% |
| d | 55945 | 4.3% |
| 0 | 51080 | 3.9% |
| n | 49723 | 3.8% |
| 5 | 48031 | 3.7% |
| t | 47994 | 3.7% |
| i | 45040 | 3.5% |
| Other values (54) | 606132 |
StreetNo
Real number (ℝ)
| Distinct | 3090 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2946.096 |
| Minimum | 1 |
|---|---|
| Maximum | 905629 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 57 |
| Q1 | 1050 |
| median | 2375 |
| Q3 | 5100 |
| 95-th percentile | 7070 |
| Maximum | 905629 |
| Range | 905628 |
| Interquartile range (IQR) | 4050 |
Descriptive statistics
| Standard deviation | 3997.6535 |
|---|---|
| Coefficient of variation (CV) | 1.3569325 |
| Kurtosis | 33315.386 |
| Mean | 2946.096 |
| Median Absolute Deviation (MAD) | 1655 |
| Skewness | 147.65197 |
| Sum | 2.2989271 × 108 |
| Variance | 15981234 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 1102 | 1.4% |
| 5100 | 601 | 0.8% |
| 7205 | 520 | 0.7% |
| 1250 | 448 | 0.6% |
| 1 | 442 | 0.6% |
| 2000 | 383 | 0.5% |
| 1550 | 359 | 0.5% |
| 50 | 313 | 0.4% |
| 4141 | 310 | 0.4% |
| 2425 | 304 | 0.4% |
| Other values (3080) | 73251 |
| Value | Count | Frequency (%) |
| 1 | 442 | |
| 2 | 198 | |
| 3 | 200 | |
| 4 | 154 | 0.2% |
| 5 | 7 | < 0.1% |
| 6 | 33 | < 0.1% |
| 7 | 25 | < 0.1% |
| 8 | 21 | < 0.1% |
| 9 | 20 | < 0.1% |
| 10 | 154 | 0.2% |
| Value | Count | Frequency (%) |
| 905629 | 1 | < 0.1% |
| 7895 | 138 | |
| 7890 | 7 | < 0.1% |
| 7885 | 79 | |
| 7880 | 6 | < 0.1% |
| 7875 | 30 | < 0.1% |
| 7860 | 5 | < 0.1% |
| 7855 | 5 | < 0.1% |
| 7850 | 4 | < 0.1% |
| 7840 | 1 | < 0.1% |
StreetName
Categorical
| Distinct | 669 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Dundas St E | 3202 |
|---|---|
| Matheson Blvd E | 2125 |
| Dixie Rd | 1982 |
| Hurontario St | 1971 |
| Lakeshore Rd E | 1628 |
| Other values (664) |
Length
| Max length | 26 |
|---|---|
| Median length | 22 |
| Mean length | 11.945062 |
| Min length | 3 |
Characters and Unicode
| Total characters | 932109 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 57 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Ambassador Dr |
|---|---|
| 2nd row | Ambassador Dr |
| 3rd row | Ambassador Dr |
| 4th row | Ambassador Dr |
| 5th row | Ambassador Dr |
Common Values
| Value | Count | Frequency (%) |
| Dundas St E | 3202 | 4.1% |
| Matheson Blvd E | 2125 | 2.7% |
| Dixie Rd | 1982 | 2.5% |
| Hurontario St | 1971 | 2.5% |
| Lakeshore Rd E | 1628 | 2.1% |
| Dundas St W | 1586 | 2.0% |
| City Centre Dr | 1529 | 2.0% |
| Britannia Rd E | 1441 | 1.8% |
| Tomken Rd | 1416 | 1.8% |
| Argentia Rd | 1400 | 1.8% |
| Other values (659) | 59753 |
Length
| Value | Count | Frequency (%) |
| rd | 28598 | 15.4% |
| dr | 17907 | 9.7% |
| e | 12045 | 6.5% |
| st | 9954 | 5.4% |
| blvd | 8011 | 4.3% |
| w | 7247 | 3.9% |
| dundas | 4805 | 2.6% |
| ave | 3978 | 2.1% |
| matheson | 2625 | 1.4% |
| pky | 2575 | 1.4% |
| Other values (665) | 87804 |
Most occurring characters
| Value | Count | Frequency (%) |
| 107517 | 11.5% | |
| r | 77033 | 8.3% |
| e | 71982 | 7.7% |
| a | 58785 | 6.3% |
| d | 55948 | 6.0% |
| n | 49726 | 5.3% |
| t | 47988 | 5.1% |
| i | 45032 | 4.8% |
| o | 36410 | 3.9% |
| l | 32503 | 3.5% |
| Other values (43) | 349185 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 636932 | |
| Uppercase Letter | 187129 | 20.1% |
| Space Separator | 107517 | 11.5% |
| Dash Punctuation | 480 | 0.1% |
| Other Punctuation | 51 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 77033 | |
| e | 71982 | |
| a | 58785 | |
| d | 55948 | |
| n | 49726 | 7.8% |
| t | 47988 | 7.5% |
| i | 45032 | 7.1% |
| o | 36410 | 5.7% |
| l | 32503 | 5.1% |
| s | 27702 | 4.3% |
| Other values (15) | 133823 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 31747 | |
| D | 29018 | |
| S | 18788 | |
| E | 16439 | |
| B | 14481 | |
| C | 13376 | |
| W | 11747 | 6.3% |
| M | 9514 | 5.1% |
| A | 9382 | 5.0% |
| T | 6500 | 3.5% |
| Other values (14) | 26137 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 45 | |
| . | 6 | 11.8% |
Space Separator
| Value | Count | Frequency (%) |
| 107517 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 480 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 824061 | |
| Common | 108048 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 77033 | 9.3% |
| e | 71982 | 8.7% |
| a | 58785 | 7.1% |
| d | 55948 | 6.8% |
| n | 49726 | 6.0% |
| t | 47988 | 5.8% |
| i | 45032 | 5.5% |
| o | 36410 | 4.4% |
| l | 32503 | 3.9% |
| R | 31747 | 3.9% |
| Other values (39) | 316907 |
Common
| Value | Count | Frequency (%) |
| 107517 | ||
| - | 480 | 0.4% |
| ' | 45 | < 0.1% |
| . | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 932109 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 107517 | 11.5% | |
| r | 77033 | 8.3% |
| e | 71982 | 7.7% |
| a | 58785 | 6.3% |
| d | 55948 | 6.0% |
| n | 49726 | 5.3% |
| t | 47988 | 5.1% |
| i | 45032 | 4.8% |
| o | 36410 | 3.9% |
| l | 32503 | 3.5% |
| Other values (43) | 349185 |
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Bldg 2 | 897 |
|---|---|
| Bldg 1 | 858 |
| Bldg A | 426 |
| Bldg B | 348 |
| Other values (89) | 1705 |
Length
| Max length | 18 |
|---|---|
| Median length | 1 |
| Mean length | 1.2798303 |
| Min length | 1 |
Characters and Unicode
| Total characters | 99869 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 73799 | ||
| Bldg 2 | 897 | 1.1% |
| Bldg 1 | 858 | 1.1% |
| Bldg A | 426 | 0.5% |
| Bldg B | 348 | 0.4% |
| Bldg 3 | 292 | 0.4% |
| Bldg 4 | 221 | 0.3% |
| Bldg K | 135 | 0.2% |
| Bldg C | 97 | 0.1% |
| East Tower | 67 | 0.1% |
| Other values (84) | 893 | 1.1% |
Length
| Value | Count | Frequency (%) |
| bldg | 3720 | |
| 1 | 943 | 11.3% |
| 2 | 941 | 11.2% |
| a | 448 | 5.3% |
| b | 372 | 4.4% |
| 3 | 321 | 3.8% |
| 4 | 276 | 3.3% |
| plaza | 169 | 2.0% |
| k | 135 | 1.6% |
| tower | 118 | 1.4% |
| Other values (58) | 931 | 11.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 77940 | ||
| B | 4161 | 4.2% |
| l | 3969 | 4.0% |
| g | 3806 | 3.8% |
| d | 3752 | 3.8% |
| 1 | 1103 | 1.1% |
| 2 | 1002 | 1.0% |
| a | 514 | 0.5% |
| A | 454 | 0.5% |
| 3 | 326 | 0.3% |
| Other values (43) | 2842 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 77940 | |
| Lowercase Letter | 13394 | 13.4% |
| Uppercase Letter | 5595 | 5.6% |
| Decimal Number | 2933 | 2.9% |
| Other Punctuation | 5 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 4161 | |
| A | 454 | 8.1% |
| P | 170 | 3.0% |
| K | 135 | 2.4% |
| E | 119 | 2.1% |
| T | 115 | 2.1% |
| C | 106 | 1.9% |
| H | 83 | 1.5% |
| D | 57 | 1.0% |
| W | 51 | 0.9% |
| Other values (10) | 144 | 2.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 3969 | |
| g | 3806 | |
| d | 3752 | |
| a | 514 | 3.8% |
| e | 269 | 2.0% |
| r | 225 | 1.7% |
| z | 169 | 1.3% |
| o | 151 | 1.1% |
| t | 149 | 1.1% |
| s | 121 | 0.9% |
| Other values (10) | 269 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1103 | |
| 2 | 1002 | |
| 3 | 326 | 11.1% |
| 4 | 279 | 9.5% |
| 9 | 45 | 1.5% |
| 6 | 43 | 1.5% |
| 5 | 40 | 1.4% |
| 7 | 39 | 1.3% |
| 0 | 33 | 1.1% |
| 8 | 23 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 77940 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 80880 | |
| Latin | 18989 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 4161 | |
| l | 3969 | |
| g | 3806 | |
| d | 3752 | |
| a | 514 | 2.7% |
| A | 454 | 2.4% |
| e | 269 | 1.4% |
| r | 225 | 1.2% |
| P | 170 | 0.9% |
| z | 169 | 0.9% |
| Other values (30) | 1500 | 7.9% |
Common
| Value | Count | Frequency (%) |
| 77940 | ||
| 1 | 1103 | 1.4% |
| 2 | 1002 | 1.2% |
| 3 | 326 | 0.4% |
| 4 | 279 | 0.3% |
| 9 | 45 | 0.1% |
| 6 | 43 | 0.1% |
| 5 | 40 | < 0.1% |
| 7 | 39 | < 0.1% |
| 0 | 33 | < 0.1% |
| Other values (3) | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99869 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 77940 | ||
| B | 4161 | 4.2% |
| l | 3969 | 4.0% |
| g | 3806 | 3.8% |
| d | 3752 | 3.8% |
| 1 | 1103 | 1.1% |
| 2 | 1002 | 1.0% |
| a | 514 | 0.5% |
| A | 454 | 0.5% |
| 3 | 326 | 0.3% |
| Other values (43) | 2842 | 2.8% |
UnitNo
Categorical
| Distinct | 3335 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 1 | 2762 |
|---|---|
| 2 | 2226 |
| 3 | 1941 |
| 4 | 1823 |
| Other values (3330) |
Length
| Max length | 39 |
|---|---|
| Median length | 1 |
| Mean length | 2.2277626 |
| Min length | 1 |
Characters and Unicode
| Total characters | 173839 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1153 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 24368 | ||
| 1 | 2762 | 3.5% |
| 2 | 2226 | 2.9% |
| 3 | 1941 | 2.5% |
| 4 | 1823 | 2.3% |
| 5 | 1597 | 2.0% |
| 6 | 1483 | 1.9% |
| 7 | 1286 | 1.6% |
| 8 | 1182 | 1.5% |
| 9 | 993 | 1.3% |
| Other values (3325) | 38372 |
Length
| Value | Count | Frequency (%) |
| 1 | 3473 | 5.5% |
| to | 2757 | 4.3% |
| 2 | 2690 | 4.2% |
| 3 | 2429 | 3.8% |
| 4 | 2286 | 3.6% |
| 5 | 2048 | 3.2% |
| 6 | 1838 | 2.9% |
| 7 | 1725 | 2.7% |
| 8 | 1597 | 2.5% |
| 1350 | 2.1% | |
| Other values (2124) | 41504 |
Most occurring characters
| Value | Count | Frequency (%) |
| 34555 | ||
| 1 | 28490 | |
| 2 | 18424 | |
| 0 | 18069 | |
| 3 | 10194 | 5.9% |
| 4 | 8347 | 4.8% |
| 5 | 7059 | 4.1% |
| 6 | 5947 | 3.4% |
| 7 | 5021 | 2.9% |
| 8 | 4667 | 2.7% |
| Other values (59) | 33066 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 109924 | |
| Space Separator | 34555 | 19.9% |
| Lowercase Letter | 12004 | 6.9% |
| Uppercase Letter | 10431 | 6.0% |
| Other Punctuation | 4953 | 2.8% |
| Dash Punctuation | 1812 | 1.0% |
| Open Punctuation | 70 | < 0.1% |
| Close Punctuation | 70 | < 0.1% |
| Math Symbol | 15 | < 0.1% |
| Control | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2910 | |
| B | 2399 | |
| C | 992 | 9.5% |
| F | 772 | 7.4% |
| D | 557 | 5.3% |
| E | 547 | 5.2% |
| H | 362 | 3.5% |
| L | 333 | 3.2% |
| G | 324 | 3.1% |
| K | 170 | 1.6% |
| Other values (15) | 1065 | 10.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3773 | |
| t | 3290 | |
| r | 765 | 6.4% |
| l | 743 | 6.2% |
| e | 695 | 5.8% |
| s | 410 | 3.4% |
| n | 397 | 3.3% |
| a | 329 | 2.7% |
| d | 261 | 2.2% |
| p | 236 | 2.0% |
| Other values (13) | 1105 | 9.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 28490 | |
| 2 | 18424 | |
| 0 | 18069 | |
| 3 | 10194 | 9.3% |
| 4 | 8347 | 7.6% |
| 5 | 7059 | 6.4% |
| 6 | 5947 | 5.4% |
| 7 | 5021 | 4.6% |
| 8 | 4667 | 4.2% |
| 9 | 3706 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 3862 | |
| , | 1058 | 21.4% |
| / | 20 | 0.4% |
| . | 12 | 0.2% |
| … | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 34555 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1812 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 70 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 70 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 15 |
Control
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 151404 | |
| Latin | 22435 | 12.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3773 | |
| t | 3290 | |
| A | 2910 | |
| B | 2399 | |
| C | 992 | 4.4% |
| F | 772 | 3.4% |
| r | 765 | 3.4% |
| l | 743 | 3.3% |
| e | 695 | 3.1% |
| D | 557 | 2.5% |
| Other values (38) | 5539 |
Common
| Value | Count | Frequency (%) |
| 34555 | ||
| 1 | 28490 | |
| 2 | 18424 | |
| 0 | 18069 | |
| 3 | 10194 | 6.7% |
| 4 | 8347 | 5.5% |
| 5 | 7059 | 4.7% |
| 6 | 5947 | 3.9% |
| 7 | 5021 | 3.3% |
| 8 | 4667 | 3.1% |
| Other values (11) | 10631 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 173838 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 34555 | ||
| 1 | 28490 | |
| 2 | 18424 | |
| 0 | 18069 | |
| 3 | 10194 | 5.9% |
| 4 | 8347 | 4.8% |
| 5 | 7059 | 4.1% |
| 6 | 5947 | 3.4% |
| 7 | 5021 | 2.9% |
| 8 | 4667 | 2.7% |
| Other values (58) | 33065 |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 |
PostalCode
Categorical
| Distinct | 2902 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| L5B 2C9 | 769 |
|---|---|
| L5M 4Z5 | 523 |
| L4T 2T9 | 477 |
| L5E 1V4 | 394 |
| L5P 1B2 | 386 |
| Other values (2897) |
Length
| Max length | 33 |
|---|---|
| Median length | 7 |
| Mean length | 6.9953481 |
| Min length | 1 |
Characters and Unicode
| Total characters | 545868 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 139 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | L5T 2J3 |
|---|---|
| 2nd row | L5T 2J3 |
| 3rd row | L5T 2J3 |
| 4th row | L5T 2J3 |
| 5th row | L5T 2J3 |
Common Values
| Value | Count | Frequency (%) |
| L5B 2C9 | 769 | 1.0% |
| L5M 4Z5 | 523 | 0.7% |
| L4T 2T9 | 477 | 0.6% |
| L5E 1V4 | 394 | 0.5% |
| L5P 1B2 | 386 | 0.5% |
| L5C 1V8 | 332 | 0.4% |
| L5J 1K5 | 296 | 0.4% |
| L4W 5G6 | 284 | 0.4% |
| L4X 1L4 | 249 | 0.3% |
| L5B 1M7 | 247 | 0.3% |
| Other values (2892) | 74076 |
Length
| Value | Count | Frequency (%) |
| l4w | 12403 | 8.0% |
| l5t | 8317 | 5.3% |
| l5n | 6069 | 3.9% |
| l4z | 4948 | 3.2% |
| l5l | 4693 | 3.0% |
| l5b | 4589 | 2.9% |
| l5s | 4258 | 2.7% |
| l5m | 3801 | 2.4% |
| l4t | 3311 | 2.1% |
| l5a | 3290 | 2.1% |
| Other values (1078) | 100200 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 86506 | |
| 77968 | ||
| 5 | 63752 | |
| 4 | 47369 | 8.7% |
| 1 | 39205 | 7.2% |
| 2 | 25914 | 4.7% |
| 3 | 16424 | 3.0% |
| W | 16127 | 3.0% |
| T | 14622 | 2.7% |
| 6 | 11449 | 2.1% |
| Other values (38) | 146532 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 233941 | |
| Decimal Number | 233912 | |
| Space Separator | 77968 | 14.3% |
| Lowercase Letter | 33 | < 0.1% |
| Control | 14 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 86506 | |
| W | 16127 | 6.9% |
| T | 14622 | 6.3% |
| N | 9608 | 4.1% |
| A | 9326 | 4.0% |
| B | 8749 | 3.7% |
| Z | 8458 | 3.6% |
| M | 7909 | 3.4% |
| C | 7880 | 3.4% |
| V | 7750 | 3.3% |
| Other values (12) | 57006 |
Lowercase Letter
| Value | Count | Frequency (%) |
| k | 9 | |
| l | 5 | |
| c | 5 | |
| s | 3 | 9.1% |
| t | 2 | 6.1% |
| d | 2 | 6.1% |
| g | 1 | 3.0% |
| v | 1 | 3.0% |
| h | 1 | 3.0% |
| i | 1 | 3.0% |
| Other values (3) | 3 | 9.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 63752 | |
| 4 | 47369 | |
| 1 | 39205 | |
| 2 | 25914 | |
| 3 | 16424 | 7.0% |
| 6 | 11449 | 4.9% |
| 8 | 9658 | 4.1% |
| 9 | 8879 | 3.8% |
| 7 | 8525 | 3.6% |
| 0 | 2737 | 1.2% |
Control
| Value | Count | Frequency (%) |
| 8 | ||
| 6 |
Space Separator
| Value | Count | Frequency (%) |
| 77968 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 311894 | |
| Latin | 233974 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 86506 | |
| W | 16127 | 6.9% |
| T | 14622 | 6.2% |
| N | 9608 | 4.1% |
| A | 9326 | 4.0% |
| B | 8749 | 3.7% |
| Z | 8458 | 3.6% |
| M | 7909 | 3.4% |
| C | 7880 | 3.4% |
| V | 7750 | 3.3% |
| Other values (25) | 57039 |
Common
| Value | Count | Frequency (%) |
| 77968 | ||
| 5 | 63752 | |
| 4 | 47369 | |
| 1 | 39205 | |
| 2 | 25914 | 8.3% |
| 3 | 16424 | 5.3% |
| 6 | 11449 | 3.7% |
| 8 | 9658 | 3.1% |
| 9 | 8879 | 2.8% |
| 7 | 8525 | 2.7% |
| Other values (3) | 2751 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 545868 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 86506 | |
| 77968 | ||
| 5 | 63752 | |
| 4 | 47369 | 8.7% |
| 1 | 39205 | 7.2% |
| 2 | 25914 | 4.7% |
| 3 | 16424 | 3.0% |
| W | 16127 | 3.0% |
| T | 14622 | 2.7% |
| 6 | 11449 | 2.1% |
| Other values (38) | 146532 |
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 47694 |
| Missing (%) | 61.1% |
| Memory size | 609.8 KiB |
| Northeast EA (West) | |
|---|---|
| Gateway EA (East) | |
| Dixie EA | |
| Meadowvale Business Park CC | |
| Western Business Park EA | |
| Other values (51) |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 16.483866 |
| Min length | 7 |
Characters and Unicode
| Total characters | 500104 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Gateway EA (East) |
|---|---|
| 2nd row | Gateway EA (East) |
| 3rd row | Gateway EA (East) |
| 4th row | Gateway EA (East) |
| 5th row | Gateway EA (East) |
Common Values
| Value | Count | Frequency (%) |
| Northeast EA (West) | 8087 | 10.4% |
| Gateway EA (East) | 1828 | 2.3% |
| Dixie EA | 1814 | 2.3% |
| Meadowvale Business Park CC | 1734 | 2.2% |
| Western Business Park EA | 1580 | 2.0% |
| DT Core | 1256 | 1.6% |
| DT Cooksville | 931 | 1.2% |
| Airport CC | 906 | 1.2% |
| Northeast EA (East) | 738 | 0.9% |
| Mavis-Erindale EA | 719 | 0.9% |
| Other values (46) | 10746 | 13.8% |
| (Missing) | 47694 |
Length
| Value | Count | Frequency (%) |
| ea | 15721 | |
| northeast | 8825 | 10.5% |
| west | 8730 | 10.4% |
| nhd | 5805 | 6.9% |
| park | 3715 | 4.4% |
| east | 3604 | 4.3% |
| business | 3314 | 3.9% |
| cc | 3101 | 3.7% |
| gateway | 2618 | 3.1% |
| dt | 2576 | 3.1% |
| Other values (45) | 25930 |
Most occurring characters
| Value | Count | Frequency (%) |
| 53600 | 10.7% | |
| e | 44801 | 9.0% |
| t | 42033 | 8.4% |
| s | 38109 | 7.6% |
| a | 32858 | 6.6% |
| r | 25884 | 5.2% |
| o | 23256 | 4.7% |
| E | 21305 | 4.3% |
| i | 18674 | 3.7% |
| A | 17879 | 3.6% |
| Other values (33) | 181705 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 300748 | |
| Uppercase Letter | 120982 | |
| Space Separator | 53600 | 10.7% |
| Open Punctuation | 11741 | 2.3% |
| Close Punctuation | 11741 | 2.3% |
| Dash Punctuation | 1292 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 44801 | |
| t | 42033 | |
| s | 38109 | |
| a | 32858 | |
| r | 25884 | |
| o | 23256 | |
| i | 18674 | |
| l | 13559 | 4.5% |
| n | 10785 | 3.6% |
| h | 10586 | 3.5% |
| Other values (11) | 40203 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 21305 | |
| A | 17879 | |
| N | 17487 | |
| C | 14057 | |
| W | 10310 | |
| D | 10195 | |
| H | 6417 | 5.3% |
| M | 5583 | 4.6% |
| P | 4710 | 3.9% |
| B | 3314 | 2.7% |
| Other values (8) | 9725 |
Space Separator
| Value | Count | Frequency (%) |
| 53600 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11741 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11741 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1292 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 421730 | |
| Common | 78374 | 15.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 44801 | 10.6% |
| t | 42033 | 10.0% |
| s | 38109 | 9.0% |
| a | 32858 | 7.8% |
| r | 25884 | 6.1% |
| o | 23256 | 5.5% |
| E | 21305 | 5.1% |
| i | 18674 | 4.4% |
| A | 17879 | 4.2% |
| N | 17487 | 4.1% |
| Other values (29) | 139444 |
Common
| Value | Count | Frequency (%) |
| 53600 | ||
| ( | 11741 | 15.0% |
| ) | 11741 | 15.0% |
| - | 1292 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 500104 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 53600 | 10.7% | |
| e | 44801 | 9.0% |
| t | 42033 | 8.4% |
| s | 38109 | 7.6% |
| a | 32858 | 6.6% |
| r | 25884 | 5.2% |
| o | 23256 | 4.7% |
| E | 21305 | 4.3% |
| i | 18674 | 3.7% |
| A | 17879 | 3.6% |
| Other values (33) | 181705 |
Ward
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.3925391 |
| Minimum | 1 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 105 |
| Range | 104 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.5013405 |
|---|---|
| Coefficient of variation (CV) | 0.46385208 |
| Kurtosis | 32.117201 |
| Mean | 5.3925391 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1404855 |
| Sum | 420796 |
| Variance | 6.2567041 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 33956 | |
| 1 | 6772 | 8.7% |
| 8 | 6086 | 7.8% |
| 7 | 5561 | 7.1% |
| 3 | 5005 | 6.4% |
| 9 | 4687 | 6.0% |
| 11 | 4300 | 5.5% |
| 4 | 4164 | 5.3% |
| 6 | 3584 | 4.6% |
| 2 | 3163 | 4.1% |
| Other values (2) | 755 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 6772 | 8.7% |
| 2 | 3163 | 4.1% |
| 3 | 5005 | 6.4% |
| 4 | 4164 | 5.3% |
| 5 | 33956 | |
| 6 | 3584 | 4.6% |
| 7 | 5561 | 7.1% |
| 8 | 6086 | 7.8% |
| 9 | 4687 | 6.0% |
| 10 | 754 | 1.0% |
| Value | Count | Frequency (%) |
| 105 | 1 | < 0.1% |
| 11 | 4300 | 5.5% |
| 10 | 754 | 1.0% |
| 9 | 4687 | 6.0% |
| 8 | 6086 | 7.8% |
| 7 | 5561 | 7.1% |
| 6 | 3584 | 4.6% |
| 5 | 33956 | |
| 4 | 4164 | 5.3% |
| 3 | 5005 | 6.4% |
NAICSCat
Categorical
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 14 |
| Missing (%) | < 0.1% |
| Memory size | 609.8 KiB |
| Manufacturing | |
|---|---|
| Other Services | |
| Retail | |
| Wholesale | |
| Professional | |
| Other values (30) |
Length
| Max length | 50 |
|---|---|
| Median length | 39 |
| Mean length | 13.393122 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1044918 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wholesale |
|---|---|
| 2nd row | Manufacturing |
| 3rd row | Manufacturing |
| 4th row | Manufacturing |
| 5th row | Wholesale |
Common Values
| Value | Count | Frequency (%) |
| Manufacturing | 9646 | |
| Other Services | 9030 | |
| Retail | 8746 | |
| Wholesale | 6933 | 8.9% |
| Professional | 5654 | 7.2% |
| Health Care | 5123 | 6.6% |
| Accommodation | 4920 | 6.3% |
| Transportation | 3039 | 3.9% |
| Construction | 2778 | 3.6% |
| Educational | 2430 | 3.1% |
| Other values (25) | 19720 |
Length
| Value | Count | Frequency (%) |
| services | 12273 | 10.1% |
| retail | 11036 | 9.0% |
| manufacturing | 9646 | 7.9% |
| other | 9030 | 7.4% |
| wholesale | 8711 | 7.1% |
| and | 7448 | 6.1% |
| professional | 7076 | 5.8% |
| health | 6436 | 5.3% |
| care | 6436 | 5.3% |
| accommodation | 6130 | 5.0% |
| Other values (37) | 37833 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 109148 | 10.4% |
| e | 105364 | 10.1% |
| i | 79338 | 7.6% |
| n | 76816 | 7.4% |
| t | 76331 | 7.3% |
| r | 66338 | 6.3% |
| o | 64513 | 6.2% |
| s | 54367 | 5.2% |
| c | 52555 | 5.0% |
| l | 50682 | 4.9% |
| Other values (27) | 309466 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 882975 | |
| Uppercase Letter | 115239 | 11.0% |
| Space Separator | 44548 | 4.3% |
| Other Punctuation | 2156 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 109148 | |
| e | 105364 | |
| i | 79338 | |
| n | 76816 | |
| t | 76331 | |
| r | 66338 | |
| o | 64513 | |
| s | 54367 | 6.2% |
| c | 52555 | 6.0% |
| l | 50682 | 5.7% |
| Other values (10) | 147523 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15530 | |
| R | 13949 | |
| A | 12271 | |
| M | 10669 | |
| W | 9969 | |
| C | 9433 | |
| T | 9265 | |
| O | 9030 | |
| P | 7557 | |
| H | 6436 | |
| Other values (5) | 11130 |
Space Separator
| Value | Count | Frequency (%) |
| 44548 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 998214 | |
| Common | 46704 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 109148 | |
| e | 105364 | |
| i | 79338 | 7.9% |
| n | 76816 | 7.7% |
| t | 76331 | 7.6% |
| r | 66338 | 6.6% |
| o | 64513 | 6.5% |
| s | 54367 | 5.4% |
| c | 52555 | 5.3% |
| l | 50682 | 5.1% |
| Other values (25) | 262762 |
Common
| Value | Count | Frequency (%) |
| 44548 | ||
| , | 2156 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1044918 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 109148 | 10.4% |
| e | 105364 | 10.1% |
| i | 79338 | 7.6% |
| n | 76816 | 7.4% |
| t | 76331 | 7.3% |
| r | 66338 | 6.3% |
| o | 64513 | 6.2% |
| s | 54367 | 5.2% |
| c | 52555 | 5.0% |
| l | 50682 | 4.9% |
| Other values (27) | 309466 |
NAICSDescr
Categorical
| Distinct | 1041 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Limited-service eating places | 3646 |
|---|---|
| General Automotive Repair | 1991 |
| Full-service restaurants | 1777 |
| Offices of Dentists | 1603 |
| Offices of Physicians | 1502 |
| Other values (1036) |
Length
| Max length | 175 |
|---|---|
| Median length | 80 |
| Mean length | 35.408122 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2763002 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 125 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Amusement and Sporting Goods Wholesaler-Distributors |
|---|---|
| 2nd row | Support Activities for Printing |
| 3rd row | Support Activities for Printing |
| 4th row | Other Printing |
| 5th row | Industrial Machinery, Equipment and Supplies Wholesaler-Distributors |
Common Values
| Value | Count | Frequency (%) |
| Limited-service eating places | 3646 | 4.7% |
| General Automotive Repair | 1991 | 2.6% |
| Full-service restaurants | 1777 | 2.3% |
| Offices of Dentists | 1603 | 2.1% |
| Offices of Physicians | 1502 | 1.9% |
| Offices of Lawyers | 1376 | 1.8% |
| Beauty Salons | 1302 | 1.7% |
| Other Freight Transportation Arrangement | 1253 | 1.6% |
| Elementary and Secondary Schools | 1240 | 1.6% |
| Religious Organizations | 1097 | 1.4% |
| Other values (1031) | 61246 |
Length
| Value | Count | Frequency (%) |
| and | 33328 | 10.0% |
| other | 18664 | 5.6% |
| stores | 9241 | 2.8% |
| offices | 8690 | 2.6% |
| of | 8401 | 2.5% |
| services | 8309 | 2.5% |
| all | 8269 | 2.5% |
| wholesaler-distributors | 7172 | 2.1% |
| manufacturing | 6726 | 2.0% |
| supplies | 4484 | 1.3% |
| Other values (1055) | 221527 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 278416 | 10.1% |
| 258010 | 9.3% | |
| i | 197847 | 7.2% |
| r | 189118 | 6.8% |
| n | 182948 | 6.6% |
| t | 181610 | 6.6% |
| a | 180904 | 6.5% |
| s | 160054 | 5.8% |
| o | 139237 | 5.0% |
| l | 115454 | 4.2% |
| Other values (55) | 879404 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2191707 | |
| Uppercase Letter | 275883 | 10.0% |
| Space Separator | 258451 | 9.4% |
| Dash Punctuation | 17701 | 0.6% |
| Other Punctuation | 11365 | 0.4% |
| Open Punctuation | 4146 | 0.2% |
| Close Punctuation | 3338 | 0.1% |
| Control | 405 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 278416 | |
| i | 197847 | |
| r | 189118 | |
| n | 182948 | 8.3% |
| t | 181610 | 8.3% |
| a | 180904 | 8.3% |
| s | 160054 | 7.3% |
| o | 139237 | 6.4% |
| l | 115454 | 5.3% |
| c | 105554 | 4.8% |
| Other values (16) | 460565 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 38630 | |
| O | 30834 | |
| A | 24801 | 9.0% |
| C | 24420 | 8.9% |
| M | 21757 | 7.9% |
| P | 18971 | 6.9% |
| D | 14639 | 5.3% |
| W | 12579 | 4.6% |
| E | 11730 | 4.3% |
| F | 11257 | 4.1% |
| Other values (15) | 66265 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 9662 | |
| ' | 803 | 7.1% |
| & | 488 | 4.3% |
| . | 412 | 3.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 258010 | ||
| 441 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17701 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4146 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3338 |
Control
| Value | Count | Frequency (%) |
| 405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2467590 | |
| Common | 295412 | 10.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 278416 | 11.3% |
| i | 197847 | 8.0% |
| r | 189118 | 7.7% |
| n | 182948 | 7.4% |
| t | 181610 | 7.4% |
| a | 180904 | 7.3% |
| s | 160054 | 6.5% |
| o | 139237 | 5.6% |
| l | 115454 | 4.7% |
| c | 105554 | 4.3% |
| Other values (41) | 736448 |
Common
| Value | Count | Frequency (%) |
| 258010 | ||
| - | 17701 | 6.0% |
| , | 9662 | 3.3% |
| ( | 4146 | 1.4% |
| ) | 3338 | 1.1% |
| ' | 803 | 0.3% |
| & | 488 | 0.2% |
| 441 | 0.1% | |
| . | 412 | 0.1% |
| 405 | 0.1% | |
| Other values (4) | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2762561 | |
| None | 441 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 278416 | 10.1% |
| 258010 | 9.3% | |
| i | 197847 | 7.2% |
| r | 189118 | 6.8% |
| n | 182948 | 6.6% |
| t | 181610 | 6.6% |
| a | 180904 | 6.5% |
| s | 160054 | 5.8% |
| o | 139237 | 5.0% |
| l | 115454 | 4.2% |
| Other values (54) | 878963 |
None
| Value | Count | Frequency (%) |
| 441 |
Phone
Categorical
| Distinct | 25064 |
|---|---|
| Distinct (%) | 32.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 1457 | |
| 905-615-3200 | 40 |
| 905-624-3811 | 35 |
| 000-000-0000 | 35 |
| 905-615-3777 | 24 |
| Other values (25059) |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 11.666654 |
| Min length | 1 |
Characters and Unicode
| Total characters | 910384 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 7404 ? |
|---|---|
| Unique (%) | 9.5% |
Sample
| 1st row | 905-795-8900 |
|---|---|
| 2nd row | 905-795-9575 |
| 3rd row | 905-795-9519 |
| 4th row | 905-564-8121 |
| 5th row | 905-564-8080 |
Common Values
| Value | Count | Frequency (%) |
| 1457 | 1.9% | |
| 905-615-3200 | 40 | 0.1% |
| 905-624-3811 | 35 | < 0.1% |
| 000-000-0000 | 35 | < 0.1% |
| 905-615-3777 | 24 | < 0.1% |
| 905-677-9354 | 21 | < 0.1% |
| 905-670-4070 | 20 | < 0.1% |
| 905-615-4640 | 20 | < 0.1% |
| 905-615-4750 | 20 | < 0.1% |
| 905-615-4653 | 18 | < 0.1% |
| Other values (25054) | 76343 |
Length
| Value | Count | Frequency (%) |
| 905-615-3200 | 40 | 0.1% |
| 000-000-0000 | 35 | < 0.1% |
| 905-624-3811 | 35 | < 0.1% |
| 905-615-3777 | 24 | < 0.1% |
| 905-677-9354 | 21 | < 0.1% |
| 905-670-4070 | 20 | < 0.1% |
| 905-615-4640 | 20 | < 0.1% |
| 905-615-4750 | 20 | < 0.1% |
| 905-615-4653 | 18 | < 0.1% |
| 905-949-2222 | 17 | < 0.1% |
| Other values (25058) | 76340 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 143128 | |
| 0 | 136709 | |
| 5 | 117588 | |
| 9 | 114776 | |
| 2 | 71079 | |
| 6 | 70911 | |
| 7 | 60428 | |
| 8 | 60294 | |
| 1 | 49065 | 5.4% |
| 4 | 46596 | 5.1% |
| Other values (11) | 39810 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 765763 | |
| Dash Punctuation | 143132 | 15.7% |
| Space Separator | 1471 | 0.2% |
| Other Punctuation | 9 | < 0.1% |
| Lowercase Letter | 7 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 136709 | |
| 5 | 117588 | |
| 9 | 114776 | |
| 2 | 71079 | |
| 6 | 70911 | |
| 7 | 60428 | |
| 8 | 60294 | |
| 1 | 49065 | 6.4% |
| 4 | 46596 | 6.1% |
| 3 | 38317 | 5.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2 | |
| x | 2 | |
| t | 2 | |
| e | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 143128 | |
| – | 4 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 | |
| ; | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| B | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1471 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 910375 | |
| Latin | 9 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 143128 | |
| 0 | 136709 | |
| 5 | 117588 | |
| 9 | 114776 | |
| 2 | 71079 | |
| 6 | 70911 | |
| 7 | 60428 | |
| 8 | 60294 | |
| 1 | 49065 | 5.4% |
| 4 | 46596 | 5.1% |
| Other values (5) | 39801 | 4.4% |
Latin
| Value | Count | Frequency (%) |
| o | 2 | |
| x | 2 | |
| t | 2 | |
| E | 1 | |
| e | 1 | |
| B | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 910380 | |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 143128 | |
| 0 | 136709 | |
| 5 | 117588 | |
| 9 | 114776 | |
| 2 | 71079 | |
| 6 | 70911 | |
| 7 | 60428 | |
| 8 | 60294 | |
| 1 | 49065 | 5.4% |
| 4 | 46596 | 5.1% |
| Other values (10) | 39806 | 4.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 |
Fax
Categorical
| Distinct | 15752 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 905-822-2673 | 41 |
|---|---|
| 905-361-6401 | 37 |
| 905-896-9380 | 31 |
| 905-502-6982 | 18 |
| Other values (15747) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 7.7663296 |
| Min length | 1 |
Characters and Unicode
| Total characters | 606030 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4752 ? |
|---|---|
| Unique (%) | 6.1% |
Sample
| 1st row | 905-795-8988 |
|---|---|
| 2nd row | 905-795-8775 |
| 3rd row | 905-795-8775 |
| 4th row | 905-564-7395 |
| 5th row | 905-564-5003 |
Common Values
| Value | Count | Frequency (%) |
| 29474 | ||
| 905-822-2673 | 41 | 0.1% |
| 905-361-6401 | 37 | < 0.1% |
| 905-896-9380 | 31 | < 0.1% |
| 905-502-6982 | 18 | < 0.1% |
| 905-625-4815 | 17 | < 0.1% |
| 905-542-0987 | 16 | < 0.1% |
| 905-607-9204 | 16 | < 0.1% |
| 905-625-8815 | 15 | < 0.1% |
| 905-403-8409 | 14 | < 0.1% |
| Other values (15742) | 48354 |
Length
| Value | Count | Frequency (%) |
| 905-822-2673 | 41 | 0.1% |
| 905-361-6401 | 37 | 0.1% |
| 905-896-9380 | 31 | 0.1% |
| 905-502-6982 | 18 | < 0.1% |
| 905-625-4815 | 17 | < 0.1% |
| 905-542-0987 | 16 | < 0.1% |
| 905-607-9204 | 16 | < 0.1% |
| 905-625-8815 | 15 | < 0.1% |
| 905-403-8409 | 14 | < 0.1% |
| 905-625-8245 | 13 | < 0.1% |
| Other values (15742) | 48342 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 90675 | |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 5.0% |
| 29475 | 4.9% | |
| Other values (2) | 53172 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 485880 | |
| Dash Punctuation | 90675 | 15.0% |
| Space Separator | 29475 | 4.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 6.2% |
| 4 | 27785 | 5.7% |
| 3 | 25387 | 5.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 90675 |
Space Separator
| Value | Count | Frequency (%) |
| 29475 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 606030 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 90675 | |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 5.0% |
| 29475 | 4.9% | |
| Other values (2) | 53172 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 606030 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 90675 | |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 5.0% |
| 29475 | 4.9% | |
| Other values (2) | 53172 |
TollFree
Categorical
| Distinct | 4117 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 1-800-769-2511 | 32 |
|---|---|
| 1-800-465-2422 | 32 |
| 1-800-472-6842 | 23 |
| 1-877-777-8672 | 16 |
| Other values (4112) |
Length
| Max length | 16 |
|---|---|
| Median length | 1 |
| Mean length | 2.8538695 |
| Min length | 1 |
Characters and Unicode
| Total characters | 222696 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1434 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 1-800-668-1101 |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 66597 | ||
| 1-800-769-2511 | 32 | < 0.1% |
| 1-800-465-2422 | 32 | < 0.1% |
| 1-800-472-6842 | 23 | < 0.1% |
| 1-877-777-8672 | 16 | < 0.1% |
| 1-877-849-3637 | 16 | < 0.1% |
| 1-866-567-8888 | 13 | < 0.1% |
| 1-800-668-0414 | 10 | < 0.1% |
| 1-800-956-9543 | 10 | < 0.1% |
| 1-866-829-9433 | 10 | < 0.1% |
| Other values (4107) | 11274 | 14.4% |
Length
| Value | Count | Frequency (%) |
| 1-800-769-2511 | 32 | 0.3% |
| 1-800-465-2422 | 32 | 0.3% |
| 1-800-472-6842 | 23 | 0.2% |
| 1-877-777-8672 | 16 | 0.1% |
| 1-877-849-3637 | 16 | 0.1% |
| 1-866-567-8888 | 13 | 0.1% |
| 1-877-526-6639 | 10 | 0.1% |
| 1-800-254-0778 | 10 | 0.1% |
| 1-800-563-4327 | 10 | 0.1% |
| 1-866-829-9433 | 10 | 0.1% |
| Other values (4111) | 11269 |
Most occurring characters
| Value | Count | Frequency (%) |
| 66602 | ||
| - | 31297 | |
| 8 | 24221 | 10.9% |
| 1 | 16130 | 7.2% |
| 0 | 14466 | 6.5% |
| 6 | 14461 | 6.5% |
| 7 | 12782 | 5.7% |
| 5 | 9818 | 4.4% |
| 2 | 9799 | 4.4% |
| 3 | 8526 | 3.8% |
| Other values (5) | 14594 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 124793 | |
| Space Separator | 66602 | |
| Dash Punctuation | 31299 | 14.1% |
| Lowercase Letter | 1 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 24221 | |
| 1 | 16130 | |
| 0 | 14466 | |
| 6 | 14461 | |
| 7 | 12782 | |
| 5 | 9818 | |
| 2 | 9799 | |
| 3 | 8526 | 6.8% |
| 4 | 7930 | 6.4% |
| 9 | 6660 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 31297 | |
| – | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 66602 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 222695 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 66602 | ||
| - | 31297 | |
| 8 | 24221 | 10.9% |
| 1 | 16130 | 7.2% |
| 0 | 14466 | 6.5% |
| 6 | 14461 | 6.5% |
| 7 | 12782 | 5.7% |
| 5 | 9818 | 4.4% |
| 2 | 9799 | 4.4% |
| 3 | 8526 | 3.8% |
| Other values (4) | 14593 | 6.6% |
Latin
| Value | Count | Frequency (%) |
| x | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 222694 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 66602 | ||
| - | 31297 | |
| 8 | 24221 | 10.9% |
| 1 | 16130 | 7.2% |
| 0 | 14466 | 6.5% |
| 6 | 14461 | 6.5% |
| 7 | 12782 | 5.7% |
| 5 | 9818 | 4.4% |
| 2 | 9799 | 4.4% |
| 3 | 8526 | 3.8% |
| Other values (4) | 14592 | 6.6% |
Punctuation
| Value | Count | Frequency (%) |
| – | 2 |
EMail
Categorical
| Distinct | 15058 |
|---|---|
| Distinct (%) | 19.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| info@publicstoragecanada.com | 21 |
|---|---|
| info@taxwide.com | 20 |
| info@ucmas.ca | 13 |
| info@mississaugaschoolofmusic.ca | 13 |
| Other values (15053) |
Length
| Max length | 97 |
|---|---|
| Median length | 55 |
| Mean length | 14.084964 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1099092 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3361 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | lfinch@golftrendsinc.com |
|---|---|
| 2nd row | prepress@apexgraphics.com |
| 3rd row | |
| 4th row | info@printmedia.ca |
| 5th row | shsieh@swrltd.com |
Common Values
| Value | Count | Frequency (%) |
| 30507 | ||
| info@publicstoragecanada.com | 21 | < 0.1% |
| info@taxwide.com | 20 | < 0.1% |
| info@ucmas.ca | 13 | < 0.1% |
| info@mississaugaschoolofmusic.ca | 13 | < 0.1% |
| cyclone@cyclonemfg.com | 12 | < 0.1% |
| millertrailers@rogers.com | 12 | < 0.1% |
| info@realfruitbubbletea.com | 12 | < 0.1% |
| info@akaloptical.com | 12 | < 0.1% |
| ktc.ca.info@kapsch.net | 12 | < 0.1% |
| Other values (15048) | 47399 |
Length
| Value | Count | Frequency (%) |
| info@publicstoragecanada.com | 21 | < 0.1% |
| info@taxwide.com | 20 | < 0.1% |
| info@ucmas.ca | 13 | < 0.1% |
| info@mississaugaschoolofmusic.ca | 13 | < 0.1% |
| cyclone@cyclonemfg.com | 12 | < 0.1% |
| millertrailers@rogers.com | 12 | < 0.1% |
| info@realfruitbubbletea.com | 12 | < 0.1% |
| info@akaloptical.com | 12 | < 0.1% |
| ktc.ca.info@kapsch.net | 12 | < 0.1% |
| insure@all-risks.com | 11 | < 0.1% |
| Other values (15012) | 47482 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 99086 | 9.0% |
| a | 97080 | 8.8% |
| c | 83214 | 7.6% |
| i | 74076 | 6.7% |
| e | 72811 | 6.6% |
| n | 63754 | 5.8% |
| m | 63062 | 5.7% |
| s | 58432 | 5.3% |
| r | 53466 | 4.9% |
| . | 51798 | 4.7% |
| Other values (68) | 382313 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 953466 | |
| Other Punctuation | 99332 | 9.0% |
| Space Separator | 30708 | 2.8% |
| Decimal Number | 11022 | 1.0% |
| Uppercase Letter | 1925 | 0.2% |
| Dash Punctuation | 1864 | 0.2% |
| Connector Punctuation | 766 | 0.1% |
| Control | 4 | < 0.1% |
| Modifier Symbol | 3 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 99086 | |
| a | 97080 | |
| c | 83214 | 8.7% |
| i | 74076 | 7.8% |
| e | 72811 | 7.6% |
| n | 63754 | 6.7% |
| m | 63062 | 6.6% |
| s | 58432 | 6.1% |
| r | 53466 | 5.6% |
| t | 50375 | 5.3% |
| Other values (16) | 238110 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 281 | |
| S | 211 | 11.0% |
| M | 203 | 10.5% |
| C | 133 | 6.9% |
| A | 122 | 6.3% |
| D | 96 | 5.0% |
| P | 88 | 4.6% |
| B | 81 | 4.2% |
| J | 79 | 4.1% |
| T | 77 | 4.0% |
| Other values (16) | 554 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1932 | |
| 0 | 1824 | |
| 2 | 1678 | |
| 3 | 975 | |
| 5 | 873 | |
| 4 | 804 | |
| 7 | 764 | 6.9% |
| 6 | 755 | 6.8% |
| 8 | 753 | 6.8% |
| 9 | 664 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 51798 | |
| @ | 47451 | |
| / | 35 | < 0.1% |
| & | 18 | < 0.1% |
| , | 8 | < 0.1% |
| ' | 7 | < 0.1% |
| # | 5 | < 0.1% |
| : | 5 | < 0.1% |
| · | 5 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 30708 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1864 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 766 |
Control
| Value | Count | Frequency (%) |
| 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 955391 | |
| Common | 143701 | 13.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 99086 | |
| a | 97080 | |
| c | 83214 | 8.7% |
| i | 74076 | 7.8% |
| e | 72811 | 7.6% |
| n | 63754 | 6.7% |
| m | 63062 | 6.6% |
| s | 58432 | 6.1% |
| r | 53466 | 5.6% |
| t | 50375 | 5.3% |
| Other values (42) | 240035 |
Common
| Value | Count | Frequency (%) |
| . | 51798 | |
| @ | 47451 | |
| 30708 | ||
| 1 | 1932 | 1.3% |
| - | 1864 | 1.3% |
| 0 | 1824 | 1.3% |
| 2 | 1678 | 1.2% |
| 3 | 975 | 0.7% |
| 5 | 873 | 0.6% |
| 4 | 804 | 0.6% |
| Other values (16) | 3794 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1099086 | |
| None | 5 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 99086 | 9.0% |
| a | 97080 | 8.8% |
| c | 83214 | 7.6% |
| i | 74076 | 6.7% |
| e | 72811 | 6.6% |
| n | 63754 | 5.8% |
| m | 63062 | 5.7% |
| s | 58432 | 5.3% |
| r | 53466 | 4.9% |
| . | 51798 | 4.7% |
| Other values (66) | 382307 |
None
| Value | Count | Frequency (%) |
| · | 5 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
WebAddress
Categorical
| Distinct | 14200 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| www.dpcdsb.org | 221 |
|---|---|
| www.subway.com | 215 |
| www.timhortons.com | 211 |
| www.petro-canada.ca | 115 |
| Other values (14195) |
Length
| Max length | 84 |
|---|---|
| Median length | 50 |
| Mean length | 14.52579 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1133491 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2033 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | www.golftrendsinc.com |
|---|---|
| 2nd row | www.apexgraphics.com |
| 3rd row | |
| 4th row | www.printmedia.ca |
| 5th row | www.swrltd.com |
Common Values
| Value | Count | Frequency (%) |
| 21267 | 27.3% | |
| www.dpcdsb.org | 221 | 0.3% |
| www.subway.com | 215 | 0.3% |
| www.timhortons.com | 211 | 0.3% |
| www.petro-canada.ca | 115 | 0.1% |
| www.shoppersdrugmart.ca | 107 | 0.1% |
| www.mississauga.ca/portal/residents/fire | 95 | 0.1% |
| www.td.com | 91 | 0.1% |
| www.dollarama.com | 88 | 0.1% |
| www.shell.ca | 84 | 0.1% |
| Other values (14190) | 55539 |
Length
| Value | Count | Frequency (%) |
| www.dpcdsb.org | 221 | 0.4% |
| www.subway.com | 215 | 0.4% |
| www.timhortons.com | 211 | 0.4% |
| www.petro-canada.ca | 115 | 0.2% |
| www.shoppersdrugmart.ca | 107 | 0.2% |
| www.mississauga.ca/portal/residents/fire | 95 | 0.2% |
| www.td.com | 91 | 0.2% |
| www.dollarama.com | 88 | 0.2% |
| www.shell.ca | 84 | 0.1% |
| www.starbucks.ca | 83 | 0.1% |
| Other values (14093) | 55517 |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 178473 | |
| . | 114798 | 10.1% |
| c | 90001 | 7.9% |
| a | 87304 | 7.7% |
| o | 81313 | 7.2% |
| e | 65392 | 5.8% |
| m | 55956 | 4.9% |
| s | 50675 | 4.5% |
| i | 50384 | 4.4% |
| r | 49833 | 4.4% |
| Other values (70) | 309362 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 989750 | |
| Other Punctuation | 116176 | 10.2% |
| Space Separator | 21324 | 1.9% |
| Dash Punctuation | 2684 | 0.2% |
| Decimal Number | 2467 | 0.2% |
| Uppercase Letter | 1007 | 0.1% |
| Math Symbol | 52 | < 0.1% |
| Control | 10 | < 0.1% |
| Connector Punctuation | 10 | < 0.1% |
| Modifier Symbol | 8 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 178473 | |
| c | 90001 | 9.1% |
| a | 87304 | 8.8% |
| o | 81313 | 8.2% |
| e | 65392 | 6.6% |
| m | 55956 | 5.7% |
| s | 50675 | 5.1% |
| i | 50384 | 5.1% |
| r | 49833 | 5.0% |
| t | 47223 | 4.8% |
| Other values (17) | 233196 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 108 | 10.7% |
| W | 105 | 10.4% |
| S | 71 | 7.1% |
| M | 70 | 7.0% |
| T | 59 | 5.9% |
| A | 57 | 5.7% |
| L | 57 | 5.7% |
| F | 52 | 5.2% |
| R | 51 | 5.1% |
| P | 41 | 4.1% |
| Other values (16) | 336 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 551 | |
| 2 | 475 | |
| 0 | 349 | |
| 4 | 324 | |
| 3 | 230 | |
| 6 | 129 | 5.2% |
| 8 | 119 | 4.8% |
| 9 | 119 | 4.8% |
| 5 | 101 | 4.1% |
| 7 | 70 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 114798 | |
| / | 1297 | 1.1% |
| @ | 47 | < 0.1% |
| & | 18 | < 0.1% |
| \ | 6 | < 0.1% |
| , | 4 | < 0.1% |
| : | 3 | < 0.1% |
| ' | 2 | < 0.1% |
| · | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 21324 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2684 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 52 |
Control
| Value | Count | Frequency (%) |
| 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 10 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 8 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 990757 | |
| Common | 142734 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 178473 | |
| c | 90001 | 9.1% |
| a | 87304 | 8.8% |
| o | 81313 | 8.2% |
| e | 65392 | 6.6% |
| m | 55956 | 5.6% |
| s | 50675 | 5.1% |
| i | 50384 | 5.1% |
| r | 49833 | 5.0% |
| t | 47223 | 4.8% |
| Other values (43) | 234203 |
Common
| Value | Count | Frequency (%) |
| . | 114798 | |
| 21324 | 14.9% | |
| - | 2684 | 1.9% |
| / | 1297 | 0.9% |
| 1 | 551 | 0.4% |
| 2 | 475 | 0.3% |
| 0 | 349 | 0.2% |
| 4 | 324 | 0.2% |
| 3 | 230 | 0.2% |
| 6 | 129 | 0.1% |
| Other values (17) | 573 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1133487 | |
| None | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| w | 178473 | |
| . | 114798 | 10.1% |
| c | 90001 | 7.9% |
| a | 87304 | 7.7% |
| o | 81313 | 7.2% |
| e | 65392 | 5.8% |
| m | 55956 | 4.9% |
| s | 50675 | 4.5% |
| i | 50384 | 4.4% |
| r | 49833 | 4.4% |
| Other values (68) | 309358 |
None
| Value | Count | Frequency (%) |
| é | 3 | |
| · | 1 | 25.0% |
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2646 |
| Missing (%) | 3.4% |
| Memory size | 609.8 KiB |
| 1 to 4 | |
|---|---|
| 5 to 9 | |
| 10 to 19 | |
| 1 - 4 | |
| 20 to 49 | |
| Other values (14) |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.4964384 |
| Min length | 5 |
Characters and Unicode
| Total characters | 489747 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10 to 19 |
|---|---|
| 2nd row | 20 to 49 |
| 3rd row | 50 to 99 |
| 4th row | 1 to 4 |
| 5th row | 5 to 9 |
Common Values
| Value | Count | Frequency (%) |
| 1 to 4 | 28587 | |
| 5 to 9 | 12508 | |
| 10 to 19 | 8204 | 10.5% |
| 1 - 4 | 7498 | 9.6% |
| 20 to 49 | 6290 | 8.1% |
| 5 - 9 | 3032 | 3.9% |
| 50 to 99 | 2582 | 3.3% |
| 10 - 19 | 1967 | 2.5% |
| 100 to 299 | 1640 | 2.1% |
| 20 - 49 | 1527 | 2.0% |
| Other values (9) | 1552 | 2.0% |
| (Missing) | 2646 | 3.4% |
Length
| Value | Count | Frequency (%) |
| to | 60184 | |
| 1 | 36085 | |
| 4 | 36085 | |
| 5 | 15540 | 6.9% |
| 9 | 15540 | 6.9% |
| 15116 | 6.7% | |
| 10 | 10171 | 4.5% |
| 19 | 10171 | 4.5% |
| 20 | 7817 | 3.5% |
| 49 | 7817 | 3.5% |
| Other values (11) | 11533 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 150672 | ||
| t | 60184 | 12.3% |
| o | 60184 | 12.3% |
| 1 | 58552 | 12.0% |
| 9 | 45056 | 9.2% |
| 4 | 44205 | 9.0% |
| 0 | 26431 | 5.4% |
| 5 | 18886 | 3.9% |
| - | 15116 | 3.1% |
| 2 | 9855 | 2.0% |
| Other values (6) | 606 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 203288 | |
| Space Separator | 150672 | |
| Lowercase Letter | 120656 | |
| Dash Punctuation | 15116 | 3.1% |
| Math Symbol | 15 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 58552 | |
| 9 | 45056 | |
| 4 | 44205 | |
| 0 | 26431 | |
| 5 | 18886 | 9.3% |
| 2 | 9855 | 4.8% |
| 3 | 303 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 60184 | |
| o | 60184 | |
| p | 72 | 0.1% |
| l | 72 | 0.1% |
| u | 72 | 0.1% |
| s | 72 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 150672 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15116 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 15 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 369091 | |
| Latin | 120656 | 24.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 150672 | ||
| 1 | 58552 | 15.9% |
| 9 | 45056 | 12.2% |
| 4 | 44205 | 12.0% |
| 0 | 26431 | 7.2% |
| 5 | 18886 | 5.1% |
| - | 15116 | 4.1% |
| 2 | 9855 | 2.7% |
| 3 | 303 | 0.1% |
| + | 15 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| t | 60184 | |
| o | 60184 | |
| p | 72 | 0.1% |
| l | 72 | 0.1% |
| u | 72 | 0.1% |
| s | 72 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 489747 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 150672 | ||
| t | 60184 | 12.3% |
| o | 60184 | 12.3% |
| 1 | 58552 | 12.0% |
| 9 | 45056 | 9.2% |
| 4 | 44205 | 9.0% |
| 0 | 26431 | 5.4% |
| 5 | 18886 | 3.9% |
| - | 15116 | 3.1% |
| 2 | 9855 | 2.0% |
| Other values (6) | 606 | 0.1% |
| Distinct | 433 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 15002 |
| Missing (%) | 19.2% |
| Memory size | 609.8 KiB |
| 2017/11/08 00:00:00+00 | |
|---|---|
| 2018/12/30 00:00:00+00 | |
| 2017/11/09 00:00:00+00 | |
| 2015/10/31 00:00:00+00 | |
| 2016/10/31 00:00:00+00 | |
| Other values (428) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 1386682 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 111 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2015/10/31 00:00:00+00 |
|---|---|
| 2nd row | 2016/10/31 00:00:00+00 |
| 3rd row | 2015/10/31 00:00:00+00 |
| 4th row | 2015/10/31 00:00:00+00 |
| 5th row | 2015/10/31 00:00:00+00 |
Common Values
| Value | Count | Frequency (%) |
| 2017/11/08 00:00:00+00 | 11038 | |
| 2018/12/30 00:00:00+00 | 9918 | |
| 2017/11/09 00:00:00+00 | 8042 | |
| 2015/10/31 00:00:00+00 | 4560 | 5.8% |
| 2016/10/31 00:00:00+00 | 4499 | 5.8% |
| 2019/12/12 00:00:00+00 | 3326 | 4.3% |
| 2019/09/19 00:00:00+00 | 2718 | 3.5% |
| 2018/09/30 00:00:00+00 | 849 | 1.1% |
| 2017/06/08 00:00:00+00 | 726 | 0.9% |
| 2017/05/24 00:00:00+00 | 646 | 0.8% |
| Other values (423) | 16709 | |
| (Missing) | 15002 |
Length
| Value | Count | Frequency (%) |
| 00:00:00+00 | 63031 | |
| 2017/11/08 | 11038 | 8.8% |
| 2018/12/30 | 9918 | 7.9% |
| 2017/11/09 | 8042 | 6.4% |
| 2015/10/31 | 4560 | 3.6% |
| 2016/10/31 | 4499 | 3.6% |
| 2019/12/12 | 3326 | 2.6% |
| 2019/09/19 | 2718 | 2.2% |
| 2018/09/30 | 849 | 0.7% |
| 2017/06/08 | 726 | 0.6% |
| Other values (424) | 17355 | 13.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 635263 | |
| 1 | 147280 | 10.6% |
| / | 126062 | 9.1% |
| : | 126062 | 9.1% |
| 2 | 85562 | 6.2% |
| 63031 | 4.5% | |
| + | 63031 | 4.5% |
| 7 | 33727 | 2.4% |
| 8 | 28229 | 2.0% |
| 9 | 23888 | 1.7% |
| Other values (4) | 54547 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1008496 | |
| Other Punctuation | 252124 | 18.2% |
| Space Separator | 63031 | 4.5% |
| Math Symbol | 63031 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 635263 | |
| 1 | 147280 | 14.6% |
| 2 | 85562 | 8.5% |
| 7 | 33727 | 3.3% |
| 8 | 28229 | 2.8% |
| 9 | 23888 | 2.4% |
| 3 | 22832 | 2.3% |
| 5 | 16305 | 1.6% |
| 6 | 13078 | 1.3% |
| 4 | 2332 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 126062 | |
| : | 126062 |
Space Separator
| Value | Count | Frequency (%) |
| 63031 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 63031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1386682 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 635263 | |
| 1 | 147280 | 10.6% |
| / | 126062 | 9.1% |
| : | 126062 | 9.1% |
| 2 | 85562 | 6.2% |
| 63031 | 4.5% | |
| + | 63031 | 4.5% |
| 7 | 33727 | 2.4% |
| 8 | 28229 | 2.0% |
| 9 | 23888 | 1.7% |
| Other values (4) | 54547 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1386682 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 635263 | |
| 1 | 147280 | 10.6% |
| / | 126062 | 9.1% |
| : | 126062 | 9.1% |
| 2 | 85562 | 6.2% |
| 63031 | 4.5% | |
| + | 63031 | 4.5% |
| 7 | 33727 | 2.4% |
| 8 | 28229 | 2.0% |
| 9 | 23888 | 1.7% |
| Other values (4) | 54547 | 3.9% |
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 63431 |
| Missing (%) | 81.3% |
| Memory size | 609.8 KiB |
| Financial Services | 870 |
|---|---|
| Food and Beverage | 444 |
| Automotive | 329 |
| Life Sciences | 263 |
| Other values (24) | 313 |
Length
| Max length | 57 |
|---|---|
| Median length | 1 |
| Mean length | 3.3009177 |
| Min length | 1 |
Characters and Unicode
| Total characters | 48200 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 12383 | 15.9% | |
| Financial Services | 870 | 1.1% |
| Food and Beverage | 444 | 0.6% |
| Automotive | 329 | 0.4% |
| Life Sciences | 263 | 0.3% |
| Aerospace | 132 | 0.2% |
| Automotive,Aerospace | 55 | 0.1% |
| Cleantech | 24 | < 0.1% |
| Automotive,Food and Beverage | 24 | < 0.1% |
| Automotive,Aerospace,Food and Beverage | 15 | < 0.1% |
| Other values (19) | 63 | 0.1% |
| (Missing) | 63431 |
Length
| Value | Count | Frequency (%) |
| services | 884 | |
| financial | 870 | |
| and | 528 | |
| beverage | 514 | |
| food | 452 | |
| automotive | 329 | 7.4% |
| life | 281 | 6.3% |
| sciences | 265 | 5.9% |
| aerospace | 132 | 3.0% |
| automotive,aerospace | 55 | 1.2% |
| Other values (15) | 145 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 14623 | ||
| e | 5221 | 10.8% |
| i | 3691 | 7.7% |
| a | 3091 | 6.4% |
| c | 2627 | 5.5% |
| n | 2626 | 5.4% |
| o | 2183 | 4.5% |
| v | 1859 | 3.9% |
| r | 1645 | 3.4% |
| s | 1413 | 2.9% |
| Other values (16) | 9221 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29243 | |
| Space Separator | 14623 | |
| Uppercase Letter | 4130 | 8.6% |
| Other Punctuation | 204 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5221 | |
| i | 3691 | |
| a | 3091 | |
| c | 2627 | |
| n | 2626 | |
| o | 2183 | |
| v | 1859 | 6.4% |
| r | 1645 | 5.6% |
| s | 1413 | 4.8% |
| d | 1056 | 3.6% |
| Other values (8) | 3831 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1412 | |
| S | 1180 | |
| A | 680 | |
| B | 528 | 12.8% |
| L | 296 | 7.2% |
| C | 34 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 14623 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33373 | |
| Common | 14827 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5221 | |
| i | 3691 | |
| a | 3091 | |
| c | 2627 | 7.9% |
| n | 2626 | 7.9% |
| o | 2183 | 6.5% |
| v | 1859 | 5.6% |
| r | 1645 | 4.9% |
| s | 1413 | 4.2% |
| F | 1412 | 4.2% |
| Other values (14) | 7605 |
Common
| Value | Count | Frequency (%) |
| 14623 | ||
| , | 204 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14623 | ||
| e | 5221 | 10.8% |
| i | 3691 | 7.7% |
| a | 3091 | 6.4% |
| c | 2627 | 5.5% |
| n | 2626 | 5.4% |
| o | 2183 | 4.5% |
| v | 1859 | 3.9% |
| r | 1645 | 3.4% |
| s | 1413 | 2.9% |
| Other values (16) | 9221 |
| Distinct | 4685 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 47694 |
| Missing (%) | 61.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 608659.35 |
| Minimum | 596627.93 |
|---|---|
| Maximum | 616985.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 596627.93 |
|---|---|
| 5-th percentile | 601465.65 |
| Q1 | 606483.02 |
| median | 608923.98 |
| Q3 | 611391.08 |
| 95-th percentile | 614814.86 |
| Maximum | 616985.06 |
| Range | 20357.121 |
| Interquartile range (IQR) | 4908.0572 |
Descriptive statistics
| Standard deviation | 3852.0245 |
|---|---|
| Coefficient of variation (CV) | 0.0063287033 |
| Kurtosis | -0.066028416 |
| Mean | 608659.35 |
| Median Absolute Deviation (MAD) | 2462.861 |
| Skewness | -0.41317914 |
| Sum | 1.8466116 × 1010 |
| Variance | 14838093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 609556.5032 | 367 | 0.5% |
| 612552.1674 | 255 | 0.3% |
| 604009.418 | 228 | 0.3% |
| 609657.7584 | 205 | 0.3% |
| 615480.8966 | 178 | 0.2% |
| 604848.575 | 110 | 0.1% |
| 608539.0792 | 107 | 0.1% |
| 612581.1624 | 106 | 0.1% |
| 608826.735 | 100 | 0.1% |
| 600161.54 | 100 | 0.1% |
| Other values (4675) | 28583 | |
| (Missing) | 47694 |
| Value | Count | Frequency (%) |
| 596627.9342 | 2 | < 0.1% |
| 596752.9696 | 2 | < 0.1% |
| 597309.0542 | 3 | < 0.1% |
| 597312.632 | 2 | < 0.1% |
| 597772.3526 | 49 | |
| 597782.4012 | 2 | < 0.1% |
| 597812.404 | 2 | < 0.1% |
| 597933.2448 | 13 | < 0.1% |
| 597963.9396 | 25 | |
| 598104.1884 | 24 |
| Value | Count | Frequency (%) |
| 616985.0552 | 9 | |
| 616917.8604 | 1 | < 0.1% |
| 616879.86 | 1 | < 0.1% |
| 616836.9092 | 2 | < 0.1% |
| 616794.193 | 2 | < 0.1% |
| 616756.05 | 2 | < 0.1% |
| 616706.7026 | 2 | < 0.1% |
| 616695.363 | 4 | |
| 616668.1574 | 2 | < 0.1% |
| 616652.9546 | 1 | < 0.1% |
| Distinct | 4686 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 47694 |
| Missing (%) | 61.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4829613.5 |
| Minimum | 4815546.6 |
|---|---|
| Maximum | 4843107.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 4815546.6 |
|---|---|
| 5-th percentile | 4819703.7 |
| Q1 | 4825956.9 |
| median | 4829277.7 |
| Q3 | 4833786.4 |
| 95-th percentile | 4839313.8 |
| Maximum | 4843107.8 |
| Range | 27561.198 |
| Interquartile range (IQR) | 7829.5472 |
Descriptive statistics
| Standard deviation | 5660.9074 |
|---|---|
| Coefficient of variation (CV) | 0.0011721243 |
| Kurtosis | -0.58959864 |
| Mean | 4829613.5 |
| Median Absolute Deviation (MAD) | 3923.2536 |
| Skewness | -0.0065033237 |
| Sum | 1.4652564 × 1011 |
| Variance | 32045872 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4827620.949 | 367 | 0.5% |
| 4837278.362 | 255 | 0.3% |
| 4823628.592 | 228 | 0.3% |
| 4841687.188 | 205 | 0.3% |
| 4827728.859 | 178 | 0.2% |
| 4824071.126 | 110 | 0.1% |
| 4840485.574 | 107 | 0.1% |
| 4831178.774 | 106 | 0.1% |
| 4823713.954 | 100 | 0.1% |
| 4826202.792 | 100 | 0.1% |
| Other values (4676) | 28583 | |
| (Missing) | 47694 |
| Value | Count | Frequency (%) |
| 4815546.641 | 1 | < 0.1% |
| 4815609.051 | 2 | |
| 4816109.607 | 2 | |
| 4816333.508 | 2 | |
| 4816381.801 | 4 | |
| 4816389.354 | 2 | |
| 4816462.515 | 1 | < 0.1% |
| 4816663.969 | 2 | |
| 4816718.415 | 2 | |
| 4816760.675 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4843107.84 | 19 | |
| 4843040.829 | 2 | < 0.1% |
| 4842998.68 | 2 | < 0.1% |
| 4842855.077 | 2 | < 0.1% |
| 4842717.945 | 2 | < 0.1% |
| 4842534.357 | 2 | < 0.1% |
| 4842303.169 | 5 | < 0.1% |
| 4842272.626 | 2 | < 0.1% |
| 4842238.75 | 2 | < 0.1% |
| 4842206.186 | 4 | < 0.1% |
Year
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 2019 | |
|---|---|
| 2018 | |
| 2017 | |
| 2021 | |
| 2016 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 312132 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2016 |
|---|---|
| 2nd row | 2016 |
| 3rd row | 2016 |
| 4th row | 2016 |
| 5th row | 2016 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 16518 | |
| 2018 | 16351 | |
| 2017 | 15737 | |
| 2021 | 14825 | |
| 2016 | 14602 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2019 | 16518 | |
| 2018 | 16351 | |
| 2017 | 15737 | |
| 2021 | 14825 | |
| 2016 | 14602 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 92858 | |
| 0 | 78033 | |
| 1 | 78033 | |
| 9 | 16518 | 5.3% |
| 8 | 16351 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 312132 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 92858 | |
| 0 | 78033 | |
| 1 | 78033 | |
| 9 | 16518 | 5.3% |
| 8 | 16351 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 312132 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 92858 | |
| 0 | 78033 | |
| 1 | 78033 | |
| 9 | 16518 | 5.3% |
| 8 | 16351 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 312132 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 92858 | |
| 0 | 78033 | |
| 1 | 78033 | |
| 9 | 16518 | 5.3% |
| 8 | 16351 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
| Distinct | 4961 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 30339 |
| Missing (%) | 38.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11122766 |
| Minimum | 32500 |
|---|---|
| Maximum | 32656400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 32500 |
|---|---|
| 5-th percentile | 1878100 |
| Q1 | 5158600 |
| median | 10172700 |
| Q3 | 14774550 |
| 95-th percentile | 28577700 |
| Maximum | 32656400 |
| Range | 32623900 |
| Interquartile range (IQR) | 9615950 |
Descriptive statistics
| Standard deviation | 7579323.6 |
|---|---|
| Coefficient of variation (CV) | 0.68142438 |
| Kurtosis | 0.64247043 |
| Mean | 11122766 |
| Median Absolute Deviation (MAD) | 4630200 |
| Skewness | 1.0446214 |
| Sum | 5.3048918 × 1011 |
| Variance | 5.7446147 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6068300 | 587 | 0.8% |
| 31141506 | 414 | 0.5% |
| 4407700 | 328 | 0.4% |
| 9663800 | 287 | 0.4% |
| 12876900 | 216 | 0.3% |
| 24265600 | 190 | 0.2% |
| 14804200 | 186 | 0.2% |
| 31381800 | 177 | 0.2% |
| 17704200 | 161 | 0.2% |
| 10173700 | 147 | 0.2% |
| Other values (4951) | 45001 | |
| (Missing) | 30339 |
| Value | Count | Frequency (%) |
| 32500 | 3 | < 0.1% |
| 37200 | 10 | < 0.1% |
| 37300 | 2 | < 0.1% |
| 37400 | 33 | |
| 38100 | 2 | < 0.1% |
| 38300 | 9 | < 0.1% |
| 38400 | 14 | |
| 38500 | 2 | < 0.1% |
| 38600 | 13 | < 0.1% |
| 38700 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 32656400 | 1 | < 0.1% |
| 32646400 | 44 | |
| 32551400 | 1 | < 0.1% |
| 32526400 | 2 | < 0.1% |
| 32476400 | 11 | < 0.1% |
| 32442000 | 5 | < 0.1% |
| 32441600 | 2 | < 0.1% |
| 32436400 | 25 | |
| 32431500 | 43 | |
| 32371800 | 1 | < 0.1% |
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 61682 |
| Missing (%) | 79.0% |
| Memory size | 609.8 KiB |
| Northeast EA (West) | |
|---|---|
| Dixie EA | |
| Gateway EA (East) | |
| Meadowvale Business Park CC | |
| Western Business Park EA | |
| Other values (51) |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 16.545777 |
| Min length | 7 |
Characters and Unicode
| Total characters | 270540 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Cooksville NHD (East) |
|---|---|
| 2nd row | Rathwood NHD |
| 3rd row | Cooksville NHD (East) |
| 4th row | Rathwood-Applewood CN |
| 5th row | Cooksville NHD (East) |
Common Values
| Value | Count | Frequency (%) |
| Northeast EA (West) | 4700 | 6.0% |
| Dixie EA | 1048 | 1.3% |
| Gateway EA (East) | 1034 | 1.3% |
| Meadowvale Business Park CC | 998 | 1.3% |
| Western Business Park EA | 847 | 1.1% |
| DT Core | 739 | 0.9% |
| Airport CC | 507 | 0.6% |
| Northeast EA (East) | 411 | 0.5% |
| DT Cooksville | 409 | 0.5% |
| Mavis-Erindale EA | 392 | 0.5% |
| Other values (46) | 5266 | 6.7% |
| (Missing) | 61682 |
Length
| Value | Count | Frequency (%) |
| ea | 8946 | |
| northeast | 5111 | 11.3% |
| west | 5028 | 11.1% |
| nhd | 2823 | 6.2% |
| park | 2036 | 4.5% |
| east | 1943 | 4.3% |
| business | 1845 | 4.1% |
| cc | 1768 | 3.9% |
| gateway | 1473 | 3.2% |
| dt | 1330 | 2.9% |
| Other values (45) | 13072 |
Most occurring characters
| Value | Count | Frequency (%) |
| 29024 | 10.7% | |
| e | 24541 | 9.1% |
| t | 23397 | 8.6% |
| s | 21055 | 7.8% |
| a | 17998 | 6.7% |
| r | 14003 | 5.2% |
| o | 12440 | 4.6% |
| E | 11848 | 4.4% |
| A | 10106 | 3.7% |
| i | 9697 | 3.6% |
| Other values (33) | 96431 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 162451 | |
| Uppercase Letter | 65050 | |
| Space Separator | 29024 | 10.7% |
| Open Punctuation | 6677 | 2.5% |
| Close Punctuation | 6677 | 2.5% |
| Dash Punctuation | 661 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24541 | |
| t | 23397 | |
| s | 21055 | |
| a | 17998 | |
| r | 14003 | |
| o | 12440 | |
| i | 9697 | 6.0% |
| l | 6578 | 4.0% |
| h | 5996 | 3.7% |
| n | 5527 | 3.4% |
| Other values (11) | 21219 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 11848 | |
| A | 10106 | |
| N | 9262 | |
| C | 7360 | |
| W | 5875 | |
| D | 5201 | |
| H | 3127 | 4.8% |
| M | 2865 | 4.4% |
| P | 2537 | 3.9% |
| B | 1845 | 2.8% |
| Other values (8) | 5024 |
Space Separator
| Value | Count | Frequency (%) |
| 29024 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6677 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6677 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 661 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 227501 | |
| Common | 43039 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24541 | 10.8% |
| t | 23397 | 10.3% |
| s | 21055 | 9.3% |
| a | 17998 | 7.9% |
| r | 14003 | 6.2% |
| o | 12440 | 5.5% |
| E | 11848 | 5.2% |
| A | 10106 | 4.4% |
| i | 9697 | 4.3% |
| N | 9262 | 4.1% |
| Other values (29) | 73154 |
Common
| Value | Count | Frequency (%) |
| 29024 | ||
| ( | 6677 | 15.5% |
| ) | 6677 | 15.5% |
| - | 661 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 270540 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 29024 | 10.7% | |
| e | 24541 | 9.1% |
| t | 23397 | 8.6% |
| s | 21055 | 7.8% |
| a | 17998 | 6.7% |
| r | 14003 | 5.2% |
| o | 12440 | 4.6% |
| E | 11848 | 4.4% |
| A | 10106 | 3.7% |
| i | 9697 | 3.6% |
| Other values (33) | 96431 |
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 46690 |
| Missing (%) | 59.8% |
| Memory size | 609.8 KiB |
| Northeast EA (West) | |
|---|---|
| Gateway EA (East) | |
| Dixie EA | |
| Meadowvale Business Park CC | |
| Western Business Park EA | |
| Other values (52) |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 16.534633 |
| Min length | 7 |
Characters and Unicode
| Total characters | 518245 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Northeast EA (West) |
|---|---|
| 2nd row | DT Core |
| 3rd row | Northeast EA (West) |
| 4th row | DT Core |
| 5th row | DT Core |
Common Values
| Value | Count | Frequency (%) |
| Northeast EA (West) | 8989 | 11.5% |
| Gateway EA (East) | 1975 | 2.5% |
| Dixie EA | 1955 | 2.5% |
| Meadowvale Business Park CC | 1898 | 2.4% |
| Western Business Park EA | 1636 | 2.1% |
| DT Core | 1477 | 1.9% |
| Airport CC | 996 | 1.3% |
| Northeast EA (East) | 804 | 1.0% |
| Mavis-Erindale EA | 784 | 1.0% |
| DT Cooksville | 724 | 0.9% |
| Other values (47) | 10105 | 12.9% |
| (Missing) | 46690 |
Length
| Value | Count | Frequency (%) |
| ea | 17070 | |
| northeast | 9793 | 11.3% |
| west | 9630 | 11.1% |
| nhd | 5337 | 6.1% |
| park | 3923 | 4.5% |
| east | 3694 | 4.2% |
| business | 3534 | 4.1% |
| cc | 3445 | 4.0% |
| gateway | 2875 | 3.3% |
| dt | 2519 | 2.9% |
| Other values (48) | 25104 |
Most occurring characters
| Value | Count | Frequency (%) |
| 55581 | 10.7% | |
| e | 47046 | 9.1% |
| t | 44934 | 8.7% |
| s | 40159 | 7.7% |
| a | 34746 | 6.7% |
| r | 27014 | 5.2% |
| o | 23860 | 4.6% |
| E | 22566 | 4.4% |
| A | 19328 | 3.7% |
| i | 18277 | 3.5% |
| Other values (34) | 184734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 311221 | |
| Uppercase Letter | 124590 | |
| Space Separator | 55581 | 10.7% |
| Close Punctuation | 12769 | 2.5% |
| Open Punctuation | 12769 | 2.5% |
| Dash Punctuation | 1315 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 47046 | |
| t | 44934 | |
| s | 40159 | |
| a | 34746 | |
| r | 27014 | |
| o | 23860 | |
| i | 18277 | 5.9% |
| l | 12367 | 4.0% |
| h | 11504 | 3.7% |
| n | 10684 | 3.4% |
| Other values (12) | 40630 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 22566 | |
| A | 19328 | |
| N | 17803 | |
| C | 14198 | |
| W | 11322 | |
| D | 9811 | |
| H | 5878 | 4.7% |
| M | 5574 | 4.5% |
| P | 4873 | 3.9% |
| B | 3534 | 2.8% |
| Other values (8) | 9703 |
Space Separator
| Value | Count | Frequency (%) |
| 55581 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12769 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12769 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1315 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 435811 | |
| Common | 82434 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 47046 | 10.8% |
| t | 44934 | 10.3% |
| s | 40159 | 9.2% |
| a | 34746 | 8.0% |
| r | 27014 | 6.2% |
| o | 23860 | 5.5% |
| E | 22566 | 5.2% |
| A | 19328 | 4.4% |
| i | 18277 | 4.2% |
| N | 17803 | 4.1% |
| Other values (30) | 140078 |
Common
| Value | Count | Frequency (%) |
| 55581 | ||
| ) | 12769 | 15.5% |
| ( | 12769 | 15.5% |
| - | 1315 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 518245 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 55581 | 10.7% | |
| e | 47046 | 9.1% |
| t | 44934 | 8.7% |
| s | 40159 | 7.7% |
| a | 34746 | 6.7% |
| r | 27014 | 5.2% |
| o | 23860 | 4.6% |
| E | 22566 | 4.4% |
| A | 19328 | 3.7% |
| i | 18277 | 3.5% |
| Other values (34) | 184734 |
| Distinct | 189 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 63218 |
| Missing (%) | 81.0% |
| Memory size | 609.8 KiB |
| 2018/12/30 00:00:00+00 | |
|---|---|
| 2019/12/12 00:00:00+00 | |
| 2019/09/19 00:00:00+00 | |
| 2017/11/09 00:00:00+00 | |
| 2017/11/08 00:00:00+00 | |
| Other values (184) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 325930 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 50 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 2021/06/25 00:00:00+00 |
|---|---|
| 2nd row | 2021/06/03 00:00:00+00 |
| 3rd row | 2021/07/15 00:00:00+00 |
| 4th row | 2021/07/15 00:00:00+00 |
| 5th row | 2021/07/15 00:00:00+00 |
Common Values
| Value | Count | Frequency (%) |
| 2018/12/30 00:00:00+00 | 2771 | 3.6% |
| 2019/12/12 00:00:00+00 | 1848 | 2.4% |
| 2019/09/19 00:00:00+00 | 1586 | 2.0% |
| 2017/11/09 00:00:00+00 | 1111 | 1.4% |
| 2017/11/08 00:00:00+00 | 968 | 1.2% |
| 2021/07/02 00:00:00+00 | 354 | 0.5% |
| 2019/06/07 00:00:00+00 | 267 | 0.3% |
| 2021/05/21 00:00:00+00 | 186 | 0.2% |
| 2018/09/30 00:00:00+00 | 177 | 0.2% |
| 2021/05/17 00:00:00+00 | 168 | 0.2% |
| Other values (179) | 5379 | 6.9% |
| (Missing) | 63218 |
Length
| Value | Count | Frequency (%) |
| 00:00:00+00 | 14815 | |
| 2018/12/30 | 2771 | 9.4% |
| 2019/12/12 | 1848 | 6.2% |
| 2019/09/19 | 1586 | 5.4% |
| 2017/11/09 | 1111 | 3.7% |
| 2017/11/08 | 968 | 3.3% |
| 2021/07/02 | 354 | 1.2% |
| 2019/06/07 | 267 | 0.9% |
| 2021/05/21 | 186 | 0.6% |
| 2018/09/30 | 177 | 0.6% |
| Other values (180) | 5547 | 18.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 9.2% |
| / | 29630 | 9.1% |
| : | 29630 | 9.1% |
| 2 | 29181 | 9.0% |
| 14815 | 4.5% | |
| + | 14815 | 4.5% |
| 9 | 8963 | 2.7% |
| 7 | 6006 | 1.8% |
| 8 | 5090 | 1.6% |
| Other values (4) | 9100 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 237040 | |
| Other Punctuation | 59260 | 18.2% |
| Space Separator | 14815 | 4.5% |
| Math Symbol | 14815 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 12.6% |
| 2 | 29181 | 12.3% |
| 9 | 8963 | 3.8% |
| 7 | 6006 | 2.5% |
| 8 | 5090 | 2.1% |
| 3 | 3797 | 1.6% |
| 6 | 2508 | 1.1% |
| 5 | 2286 | 1.0% |
| 4 | 509 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 29630 | |
| : | 29630 |
Space Separator
| Value | Count | Frequency (%) |
| 14815 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 14815 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 325930 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 9.2% |
| / | 29630 | 9.1% |
| : | 29630 | 9.1% |
| 2 | 29181 | 9.0% |
| 14815 | 4.5% | |
| + | 14815 | 4.5% |
| 9 | 8963 | 2.7% |
| 7 | 6006 | 1.8% |
| 8 | 5090 | 1.6% |
| Other values (4) | 9100 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 9.2% |
| / | 29630 | 9.1% |
| : | 29630 | 9.1% |
| 2 | 29181 | 9.0% |
| 14815 | 4.5% | |
| + | 14815 | 4.5% |
| 9 | 8963 | 2.7% |
| 7 | 6006 | 1.8% |
| 8 | 5090 | 1.6% |
| Other values (4) | 9100 | 2.8% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 63208 |
| Missing (%) | 81.0% |
| Memory size | 609.8 KiB |
| CK | 443 |
|---|---|
| MLT | 362 |
| PC | 304 |
| STR | 215 |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.1399663 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16900 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 13414 | 17.2% | |
| CK | 443 | 0.6% |
| MLT | 362 | 0.5% |
| PC | 304 | 0.4% |
| STR | 215 | 0.3% |
| CLV | 87 | 0.1% |
| (Missing) | 63208 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ck | 443 | |
| mlt | 362 | |
| pc | 304 | |
| str | 215 | |
| clv | 87 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 13414 | ||
| C | 834 | 4.9% |
| T | 577 | 3.4% |
| L | 449 | 2.7% |
| K | 443 | 2.6% |
| M | 362 | 2.1% |
| P | 304 | 1.8% |
| S | 215 | 1.3% |
| R | 215 | 1.3% |
| V | 87 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 13414 | |
| Uppercase Letter | 3486 | 20.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 834 | |
| T | 577 | |
| L | 449 | |
| K | 443 | |
| M | 362 | |
| P | 304 | 8.7% |
| S | 215 | 6.2% |
| R | 215 | 6.2% |
| V | 87 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 13414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13414 | |
| Latin | 3486 | 20.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 834 | |
| T | 577 | |
| L | 449 | |
| K | 443 | |
| M | 362 | |
| P | 304 | 8.7% |
| S | 215 | 6.2% |
| R | 215 | 6.2% |
| V | 87 | 2.5% |
Common
| Value | Count | Frequency (%) |
| 13414 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16900 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 13414 | ||
| C | 834 | 4.9% |
| T | 577 | 3.4% |
| L | 449 | 2.7% |
| K | 443 | 2.6% |
| M | 362 | 2.1% |
| P | 304 | 1.8% |
| S | 215 | 1.3% |
| R | 215 | 1.3% |
| V | 87 | 0.5% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 63208 |
| Missing (%) | 81.0% |
| Memory size | 609.8 KiB |
| Cooksville BIA | 443 |
|---|---|
| Malton BIA | 362 |
| Port Credit BIA | 304 |
| Streetsville BIA | 215 |
Length
| Max length | 16 |
|---|---|
| Median length | 1 |
| Mean length | 2.177403 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32280 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 13414 | 17.2% | |
| Cooksville BIA | 443 | 0.6% |
| Malton BIA | 362 | 0.5% |
| Port Credit BIA | 304 | 0.4% |
| Streetsville BIA | 215 | 0.3% |
| Clarkson BIA | 87 | 0.1% |
| (Missing) | 63208 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bia | 1411 | |
| cooksville | 443 | 14.2% |
| malton | 362 | 11.6% |
| port | 304 | 9.7% |
| credit | 304 | 9.7% |
| streetsville | 215 | 6.9% |
| clarkson | 87 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 15129 | ||
| l | 1765 | 5.5% |
| o | 1639 | 5.1% |
| A | 1411 | 4.4% |
| B | 1411 | 4.4% |
| I | 1411 | 4.4% |
| t | 1400 | 4.3% |
| e | 1392 | 4.3% |
| i | 962 | 3.0% |
| r | 910 | 2.8% |
| Other values (10) | 4850 | 15.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 15129 | |
| Lowercase Letter | 11203 | |
| Uppercase Letter | 5948 | 18.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1765 | |
| o | 1639 | |
| t | 1400 | |
| e | 1392 | |
| i | 962 | |
| r | 910 | |
| s | 745 | |
| v | 658 | 5.9% |
| k | 530 | 4.7% |
| a | 449 | 4.0% |
| Other values (2) | 753 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1411 | |
| B | 1411 | |
| I | 1411 | |
| C | 834 | |
| M | 362 | 6.1% |
| P | 304 | 5.1% |
| S | 215 | 3.6% |
Space Separator
| Value | Count | Frequency (%) |
| 15129 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17151 | |
| Common | 15129 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1765 | |
| o | 1639 | |
| A | 1411 | 8.2% |
| B | 1411 | 8.2% |
| I | 1411 | 8.2% |
| t | 1400 | 8.2% |
| e | 1392 | 8.1% |
| i | 962 | 5.6% |
| r | 910 | 5.3% |
| C | 834 | 4.9% |
| Other values (9) | 4016 |
Common
| Value | Count | Frequency (%) |
| 15129 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 15129 | ||
| l | 1765 | 5.5% |
| o | 1639 | 5.1% |
| A | 1411 | 4.4% |
| B | 1411 | 4.4% |
| I | 1411 | 4.4% |
| t | 1400 | 4.3% |
| e | 1392 | 4.3% |
| i | 962 | 3.0% |
| r | 910 | 2.8% |
| Other values (10) | 4850 | 15.0% |
RecordID
Real number (ℝ)
| Distinct | 21240 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34656.92 |
| Minimum | 2 |
|---|---|
| Maximum | 94424 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2230 |
| Q1 | 9764 |
| median | 19183 |
| Q3 | 55026 |
| 95-th percentile | 88915 |
| Maximum | 94424 |
| Range | 94422 |
| Interquartile range (IQR) | 45262 |
Descriptive statistics
| Standard deviation | 29857.678 |
|---|---|
| Coefficient of variation (CV) | 0.8615214 |
| Kurtosis | -0.9937126 |
| Mean | 34656.92 |
| Median Absolute Deviation (MAD) | 16020 |
| Skewness | 0.65053975 |
| Sum | 2.7043834 × 109 |
| Variance | 8.9148093 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85606 | 6 | < 0.1% |
| 1055 | 5 | < 0.1% |
| 19338 | 5 | < 0.1% |
| 19580 | 5 | < 0.1% |
| 20871 | 5 | < 0.1% |
| 19831 | 5 | < 0.1% |
| 19332 | 5 | < 0.1% |
| 19583 | 5 | < 0.1% |
| 19832 | 5 | < 0.1% |
| 19584 | 5 | < 0.1% |
| Other values (21230) | 77982 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 7 | 5 | |
| 10 | 5 | |
| 12 | 3 | |
| 16 | 5 | |
| 18 | 5 | |
| 20 | 5 | |
| 21 | 5 | |
| 23 | 5 | |
| 26 | 4 |
| Value | Count | Frequency (%) |
| 94424 | 1 | |
| 94423 | 1 | |
| 94419 | 1 | |
| 94371 | 1 | |
| 94321 | 1 | |
| 94319 | 1 | |
| 94318 | 1 | |
| 94317 | 1 | |
| 94313 | 1 | |
| 94293 | 1 |
Closed
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 78033 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 78033 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 78033 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 78033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 78033 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 78033 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 78033 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 78033 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78033 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 78033 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| X | Y | FID | BusinessID | Name | Address | StreetNo | StreetName | BldgNo | UnitNo | PostalCode | Location | Ward | NAICSCode | NAICSCat | NAICSDescr | Phone | Fax | TollFree | WebAddress | EmplRange | EmplUpdate | Sector_Des | CENT_X | CENT_Y | Year | PIN | Character | CHArea | Modified | BIA_NAME | BIAFulName | RecordID | Closed | isnew | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -79.689829 | 43.644181 | 1 | 1055 | Golf Trends Inc. | 300 Ambassador Dr | 300 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 414470 | Wholesale | Amusement and Sporting Goods Wholesaler-Distributors | 905-795-8900 | 905-795-8988 | 1-800-668-1101 | lfinch@golftrendsinc.com | www.golftrendsinc.com | 10 to 19 | 2015/10/31 00:00:00+00 | 605668.2538 | 4.833187e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1055 | 0 | True | |||
| 1 | -79.689419 | 43.644988 | 2 | 1057 | Apex Graphics Inc. | 320 Ambassador Dr | 320 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 323120 | Manufacturing | Support Activities for Printing | 905-795-9575 | 905-795-8775 | prepress@apexgraphics.com | www.apexgraphics.com | 20 to 49 | 2016/10/31 00:00:00+00 | 605699.9370 | 4.833277e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1057 | 0 | True | ||||
| 2 | -79.689419 | 43.644988 | 3 | 1058 | Sands, John & Associates Limited | 320 Ambassador Dr | 320 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 323120 | Manufacturing | Support Activities for Printing | 905-795-9519 | 905-795-8775 | 50 to 99 | 2015/10/31 00:00:00+00 | 605699.9370 | 4.833277e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1058 | 0 | True | ||||||
| 3 | -79.689419 | 43.644988 | 4 | 1060 | Printmedia-Tackaberry Times | 320 Ambassador Dr | 320 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 323119 | Manufacturing | Other Printing | 905-564-8121 | 905-564-7395 | info@printmedia.ca | www.printmedia.ca | 1 to 4 | 2015/10/31 00:00:00+00 | 605699.9370 | 4.833277e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1060 | 0 | True | ||||
| 4 | -79.690664 | 43.645493 | 5 | 1061 | S W R Industries Ltd. | 321 Ambassador Dr | 321 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 417230 | Wholesale | Industrial Machinery, Equipment and Supplies Wholesaler-Distributors | 905-564-8080 | 905-564-5003 | shsieh@swrltd.com | www.swrltd.com | 5 to 9 | 2015/10/31 00:00:00+00 | 605598.6442 | 4.833332e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1061 | 0 | True | ||||
| 5 | -79.690277 | 43.646372 | 6 | 1063 | Crossdock Freight Solutions | 361 Ambassador Dr | 361 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 488519 | Transportation | Other Freight Transportation Arrangement | 905-670-4937 | 905-670-9475 | customerassist@crossdocksystems.com | www.crossdockfreight.com | 20 to 49 | 2015/10/31 00:00:00+00 | 605628.2838 | 4.833430e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1063 | 0 | True | ||||
| 6 | -79.689877 | 43.646914 | 7 | 1065 | Green Belting Industries Ltd. | 381 Ambassador Dr | 381 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 325510 | Manufacturing | Paint and Coating Manufacturing | 905-564-6712 | 905-564-6709 | 1-800-668-1114 | customerservice@greenbelting.com | www.greenbelting.com | 50 to 99 | 2016/10/31 00:00:00+00 | 605659.5646 | 4.833490e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1065 | 0 | True | |||
| 7 | -79.634279 | 43.640404 | 8 | 1073 | Dafco Filtration Group Corporation | 5390 Ambler Dr | 5390 | Ambler Dr | B | L4W 1G9 | Northeast EA (West) | 5 | 333413 | Manufacturing | Industrial and Commercial Fan and Blower and Air Purification Equipment Manufacturing | 905-602-1010 | 905-629-1124 | info@dafcofiltrationgroup.com | www.dafco.ca | 50 to 99 | 2016/10/31 00:00:00+00 | 610155.4182 | 4.832840e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1073 | 0 | True | |||
| 8 | -79.632844 | 43.641337 | 9 | 1074 | Ace Trans Inc. | 5391 Ambler Dr | 5391 | Ambler Dr | 1 | L4W 1H1 | Northeast EA (West) | 5 | 493110 | Transportation | General Warehousing and Storage | 905-625-3000 | 905-625-6049 | info@acetrans.ca | www.acetrans.ca | 1 to 4 | 2016/10/31 00:00:00+00 | 610269.4640 | 4.832945e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1074 | 0 | True | |||
| 9 | -79.637815 | 43.642638 | 10 | 1077 | Petro Maxx | 5510 Ambler Dr | 5510 | Ambler Dr | 1 to 2 | L4W 2V1 | Northeast EA (West) | 5 | 541490 | Professional | Other Specialized Design Services | 905-206-0040 | blake@petromaxx.ca | www.maxxgroupofcompanies.ca | 20 to 49 | 2015/10/31 00:00:00+00 | 609866.1452 | 4.833083e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1077 | 0 | True |
| X | Y | FID | BusinessID | Name | Address | StreetNo | StreetName | BldgNo | UnitNo | PostalCode | Location | Ward | NAICSCode | NAICSCat | NAICSDescr | Phone | Fax | TollFree | WebAddress | EmplRange | EmplUpdate | Sector_Des | CENT_X | CENT_Y | Year | PIN | Character | CHArea | Modified | BIA_NAME | BIAFulName | RecordID | Closed | isnew | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 78023 | 608544.3664 | 4.840490e+06 | 14816 | 57550 | Advance Car & Truck Rental | 2960 Drew Rd | 2960 | Drew Rd | 149 | L4T 0A5 | NaN | 5 | 532111 | Real Estate | Passenger Car Rental | 905-461-7368 | 905-461-6666 | 1-877-303-7368 | Advancerental@gmail.com | www.advancerental.ca | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/06/22 00:00:00+00 | MLT | Malton BIA | 57550 | 0 | False | |
| 78024 | 608544.3664 | 4.840490e+06 | 14817 | 57551 | Video Palace | 2960 Drew Rd | 2960 | Drew Rd | 150 | L4T 0A5 | NaN | 5 | 532280 | Real Estate | All Other Consumer Goods Rental | 905-678-7878 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/06/02 00:00:00+00 | MLT | Malton BIA | 57551 | 0 | False | |||||
| 78025 | 608544.3664 | 4.840490e+06 | 14818 | 57552 | Secure Life Insurance Agency Inc. | 2960 Drew Rd | 2960 | Drew Rd | 151 | L4T 0A5 | NaN | 5 | 524112 | Finance | Direct Group Life, Health and Medical Insurance Carriers | 1-800-746-9122 | www.securelifeinsurance.ca | NaN | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 57552 | 0 | False | ||||
| 78026 | 608544.3664 | 4.840490e+06 | 14819 | 57555 | Skillman Flooring | 2960 Drew Rd | 2960 | Drew Rd | 155&157B | L4T 0A5 | NaN | 5 | 442210 | Retail | Floor Covering Stores | 905-676-9111 | 905-676-9113 | skillmanflooring@live.ca | www.skillmanflooring.com | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2019/12/12 00:00:00+00 | MLT | Malton BIA | 57555 | 0 | False | ||
| 78027 | 608544.3664 | 4.840490e+06 | 14820 | 57557 | Verma Vastar Manufacturing Inc. | 2960 Drew Rd | 2960 | Drew Rd | 160 | L4T 0A5 | NaN | 5 | 315210 | Manufacturing | Cut and Sew Clothing Contracting | 647-669-4545 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 57557 | 0 | False | |||||
| 78028 | 608544.3664 | 4.840490e+06 | 14821 | 60142 | JobsForU | 2960 Drew Rd | 2960 | Drew Rd | 156 | L4T 0A5 | NaN | 5 | 561310 | Administrative | Employment Placement Agencies and Executive Search Services | 416-825-4000 | navjot@jobsforu.ca | www.jobsforu.ca | 10 to 19 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/07/30 00:00:00+00 | MLT | Malton BIA | 60142 | 0 | True | |||
| 78029 | 608544.3664 | 4.840490e+06 | 14822 | 60159 | Elite Source Solutions | 2980 Drew Rd | 2980 | Drew Rd | 133 | L4T 0A7 | NaN | 5 | 561310 | Administrative | Employment Placement Agencies and Executive Search Services | 905-598-3542 | NaN | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 60159 | 0 | True | |||||
| 78030 | 608544.3664 | 4.840490e+06 | 14823 | 60160 | Indian Sweet Master | 2980 Drew Rd | 2980 | Drew Rd | 134 | L4T 0A7 | NaN | 5 | 722511 | Accommodation | Full-service restaurants | 905-405-8585 | NaN | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 60160 | 0 | True | |||||
| 78031 | 608544.3664 | 4.840490e+06 | 14824 | 60161 | Mississauga Flooring & Supplies Inc. | 2980 Drew Rd | 2980 | Drew Rd | 135 & 136 | L4T 0A7 | NaN | 5 | 414320 | Wholesale | Floor Covering Wholesaler-Distributors | 905-460-7005 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/08/16 00:00:00+00 | MLT | Malton BIA | 60161 | 0 | True | |||||
| 78032 | 608544.3664 | 4.840490e+06 | 14825 | 60162 | Punjabi Textile Ltd. | 2980 Drew Rd | 2980 | Drew Rd | 132 | L4T 0A7 | NaN | 5 | 414110 | Wholesale | Clothing and Clothing Accessories Wholesaler-Distributors | 905-405-1919 | NaN | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 60162 | 0 | True |